lecture 1 


Topics: 

What is physics? 

What is classical mechanics? 

Degrees of freedom 

The Art of Theoretical Physics 

Motion, trajectories and F — ma 

F = ma -v^ two initial conditions/degree of freedom 

Finding trajectories numerically 

Forces of the form F(t) 

What is physics? 

According to Wikipedia 

Physics (from the Greek, (f>vcn( (physis), “nature” and (imcnnoQ (physikos), “natural”) 
is the science of Nature, from the quarks to the cosmos. Consequently, physics treats 
of the fundamental constituents of the universe, the forces they exert on one another, 
and the results produced by these forces. Sometimes, in modern physics, a more 
sophisticated approach is taken that incorporates elements of the three areas listed 
above; it relates to the laws of symmetry and conservation, such as those pertaining to 
energy, momentum, charge, and parity. 

As much as I like symmetry and conservation laws (which we will certainly talk a lot about 
in this course) I am not sure that I like this definition much. It focuses a little too much on grand 
theoretical principles and not enough on how useful physics is. Both poles of physics - the theo¬ 
retical and the experimental/applied are crucial to the way physics works. True - physics addresses 
the most fundamental questions in the universe - but we can address these questions usefully only 
because we can use physics to build the tools required to answer the most difficult questions. And 
the answers to these questions, in turn, allow us to build better tools and ask deeper questions. This 
self-containedness - this cyclic give and take between mathematical theory and tool-building is the 
glory of physics itself, and the reason why physics is important to all other areas of science. Physi¬ 
cists build tools, both technological and mathematical, that are crucial to the pursuit of scientific 
questions in all fields and which form the basis for much of our mathematics and technology. 

And besides, physics is just a marvelously fun way of looking at the world. 

What is classical mechanics? 

Classical mechanics is a category defined by what it is not! It is not quantum mechanics. A classi¬ 
cal mechanical system is any collection of objects that we can describe to a good approximation 
without worrying about quantum mechanics. This includes most of the systems you are used to in 
everyday life (if you don’t worry too much about what goes on inside your computer or CD player 
or TV set). In this course, we will discuss these mechanical systems, and we will push beyond 
your everyday experience to discuss what happens when objects move at speeds close to the speed 
of light. We won’t study quantum mechanics explicitly, but we will talk about it at times. The 
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underlying laws of the Universe seem to be quantum mechanical, and it is fun, instructive, and 
not at all trivial to think about how classical physics emerges as an approximation to the quantum 
mechanical world. 

Degrees of freedom 

We will start slowly, with some useful definitions. 

Coordinates — The coordinates of a physical system are the numbers (possibly dimensional) 
that describe the system at a given fixed time. 

Frame of reference — The coordinates we use to describe the system are always somewhat ar¬ 
bitrary, depending on conventions. Mathematically, we should be able to change our system 
of coordinates and still describe things. But in physical problems, it is usually easiest to 
restrict the allowed coordinate systems. Fortunately, in physics, there is an obvious way of 
specifying the conventions that we use in a physical problem. We simply describe in detail 
how to measure the coordinates at any given time. Such a description is called a frame of 
reference. We will not always mention the frame of reference explicitly, because we have 
in mind something very intuitive and simple - the set of clocks, measuring sticks and other 
stuff that we need to study classical phenomena in a laboratory. But we will see (particularly 
when we discuss relativity in a few weeks) that it is important to have a frame of reference 
that fixes our conventions. Otherwise, we can get into all kinds of problems. 

Configuration — The configuration q(t) of a mechanical system is a number or vector consist¬ 
ing of values of a set of independent coordinates that completely describe the system at time 
t. 

We will call the configuration q, without specifying (at least for another few seconds) whether q is 
a single variable, or some kind of vector describing several coordinates at once. The word “inde¬ 
pendent” in the definition means that none of the numbers in our set of coordinates is redundent or 
dependent of the others. That is we asume that the coordinates in the configuration are indepen¬ 
dent in the sense that each can in independently changed, and each different set of values describes 
something physically different. Given a set of values of the coordinates at some particular time, 
we can figure out what the configuration is and thus what the system looks like at that time. Thus 
a configuration is just a mathematical snapshot of the system at a given time. 

One way of describing the underlying problem of classical mechanics is that we want to under¬ 
stand how the configuration of the system evolves with time. That is we want to put the snapshots 
together into a mathematical movie to describe how the system moves. 

Degrees of freedom — The number of independent components of the configuration q is called 
the number of degrees of freedom of the mechanical system. 

The number of degrees of freedom is the number of independent ways in which the system can 
move. Here are some examples. A point mass sliding on an airtrack, described by only a single co¬ 
ordinate, has one degree of freedom. We get more degrees of freedom if we go to more dimensions 
or to more complicated objects. 
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system 

coordinate 

# of DOFs 

point mass on a track 

t (distance along the track) 

1 

point mass on a flat surface 

(x,y) 

2 

point mass in 3-d 

r = (. x,y,z ) 

3 

rigid body in 3-d 

f of center + 3 angles 

6 

2 masses + massless spring 

f) and f ‘2 

6 

2 masses + massive spring 

fi, r 2 and spring 

“(X)” 


A continuous massive spring formally has an infinite number of degrees of freedom, because to 
specify its configuration we would have to give a continuous function describing how much every 
point on the spring is stretched. Really, of course, a physical spring has a finite but very large 
number of degrees of freedom, because it is not actually continuous, but is made up of atoms. But 
the difference between oo and Avogadro’s number is often not very important. 

This brings up an important philosophical point. What the heck is a “point” mass? What is a 
“rigid” body? What is a “massless” spring? Most of you have probably been dealing with physics 
problems for so long that you are used to these phrases. But it is important to remember that 
these are mathematical idealizations. Real physical systems are complicated, and in fact, what we 
choose for q may depend on what kind of physical questions we want to ask, what level of accuracy 
we need in the answer, and even how long we want to study the system. 

So for example, for a hockey puck sliding on the ice at the Boston Garden, we might decide 
that the configuration is specified by giving the x and y coordinates that determine the puck’s 
position in the plane of the ice. Then q would stand for the two dimensional vector, (. x,y ). But 
if we do this, we have ignored many details. For example, for a shot that comes off the ice, we 
would need to include the z coordinate to describe the motion of the puck. For some purposes, 
we would also need to include descriptions of the puck itself. For example, we have not included 
an angular variable that would allow us to specify how the puck is turned about its vertical axis. 
This is probably good enough for most problems. But sometimes, more information is required 
to give a good description of the physics. For example, if we wanted to understand how a rapidly 
rotating puck moves, we might need this more detailed information. We could also go on and 
describe how the puck might deform when hit by the stick, and so on. We could include more and 
more information until we got down to the level where we begin to see the molecular structure of 
the rubber of the puck. At this point, we begin to see quantum mechanical effects, and classical 
mechanics is no longer enough to give an accurate description. 

The Art of Theoretical Physics 

This is a good lesson. The coordinates that we use to describe the system may depend on what 
kind of information we want to get out of our mechanical model of the system, and how accurately 
we want our model to reproduce reality. We usually will not go over these niceties each time we 
discuss a system, but they are important to remember. There is really a very important point here. 
In physics courses, we frequently discuss “toy” systems which are obviously oversimplified, in 
which we have clearly left out features that are important in the “real world.” This is not something 
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to apologize for. This is precisely the art of theoretical physics. We work hard to abstract the 
essential physics of a system, without including things that don’t matter at the level of description 
that we are interested in. This down-to-earth ability to focus on the crucial parameters is far more 
important than fancy mathematical gymnastics. 

In fact, I believe, getting better at this art is one of the most useful things you can get out of 
this course. It is generally useful far beyond this or future physics courses. The ability to build 
mathematical models of phenomena is crucial to many fields. But models can be as misleading as 
they can be useful unless they focus on the right parameters, and unless the modeler is aware of the 
model’s limitations. Physics is the paradigm for this kind of thinking. This is one of the reasons 
why, over the years, trained physicists have been so much in demand in very different fields. 

Motion, trajectories and F — ma 

Trajectory — A trajectory is a possible motion of a physical system. 

We describe a trajectory by giving the value, q(t) of the coordinates of the system as a function 
of time. There are an infinite number of possible trajectories for any interesting physical system 
(that is, unless everything is nailed down and completely fixed). Our everyday experience tells us 
that knowing the configuration of the system at some time is not enough information to allow us 
to calculate the trajectory. A snapshot doesn’t tell us how the system is moving. We also need to 
know the velocity of the system — the derivative of q with respect to time. We will sometimes use 
the notation of a dot over an object to represent a time derivative: 

m = |?(t). a) 

If q describes more than one coordinate, then q describes the same number of independent veloci¬ 
ties - but we will continue to call q the velocity, even if it has multiple components. If q is a vector, 
q is the same kind of vector. For the hockey puck, 

q = (x,y), q = (x,y). (2) 

Then we can say that if we know q and q at a particular time, we should be able to calculate the 
trajectory. For example, if we know where the hockey puck is and how fast and in what direction it 
is moving, we can plot its trajectory by just assuming that it continues to move at a constant speed 
along the line determined by its position and velocity. 

Something important has happened. I have snuck in one of the critical assumptions of mechan¬ 
ics. Let me say it with the formality it deserves. 

A — The trajectory, or subsequent motion, of a mechanical system is com- (3) 
pletely determined if we know the configuration, q and the velocity, q at some 
given time. 

Because the number of components of q and of q are both equal to the number of degrees of 
freedom, this means that we have to specify two constants per degree of freedom in order to 
specify how the system moves. This is a biggie. We will come back to this again and again, as you 
will see. 
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One of the central problems of mechanics is to calculate what these trajectories are for various 
systems — that is to say, to figure out how things move. To begin, let us think about what this 
process is like for a system specified by a single coordinate, x. Newton tells us that to figure out 
how x changes as a function of time, we define our coordinates in an appropriate inertial frame 
(we’ll talk much more about what this means later) and use his second law 

F — ma (4) 

where F is the force on the object, m is its mass, and a is its acceleration (also sometimes written 
as x — one dot for each time derivative) 


= x (5) 

We will have much more to say about mass, but for now, we will just assume that m is a fixed 
property of the object. The force, F, is more complicated. In general, F will depend on what the 
system is doing. Since we have already assumed that all we need to know about the system at a 
given time is q and q, in this case x and x, all F can depend on is q, q and t. So in general, F 
at time t is some function of x and x, at that same time, and t, F(x(t),x(t),tj. Then Newton’s 
second law becomes a formula for the acceleration: 

x — — Fix, x, t) (6) 

m 

Note that in general, in Newton’s second law, the force is a function of all three variables, x, 
x, and t. Because of things l ik e this, we will often have to discuss the calculus of functions of 
several variables. I realize that some of you are just beginning to study this subject formally in 
math courses. In general, in this course, I will often make use of mathematics that many of you 
have not seen. We won’t be using any deep properties of the subject, just things that you would 
guess immediately from your knowledge of calculus of a single variable, vectors, and algebra. The 
proofs can wait until (or if) you take the appropriate math course. The important thing is that you 
try to understand the physics, and not faint when unfamiliar mathematics is thrown at you. 

F = ma two initial conditions/degree of freedom 

Now the first deep question I want to address is this: Why should the fundamental law of me¬ 
chanics (which Newton’s second law certainly is) be a formula for acceleration? A partial answer 
comes from the mathematics. A formula like (6) is called a second order differential equation 
because it involves a second derivative but no higher derivatives. Our mathematician friends tell 
us that a second order differential equation for x has in general an infinite number of solutions, 
which can be labeled by the values of x and x at some given time. These conditions that specify 
the solution are sometimes called the initial conditions, and this is certainly appropriate for the 
physics applications we have in mind. Once we specify the initial conditions, the initial position 
and velocity, the solution of the differential equation, which is the trajectory, is completely spec¬ 
ified. This means that if the fundamental law is a second order differential equation, like (6), the 
mathematics guarantees that assumption A is satisfied. Thus we have shown that F = ma implies 
A. This is a good thing. 
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In general, I am not so interested in proofs in this course, at least not in the mathematical sense. 
However, in the notes, and in lecture if I have time, I will give a “physicst’s proof” that the solution 
of a second order differential equation is specified by q and q. 

Here is a simple and probably very familiar example. Suppose that the force is constant, 


F(x, x, t) 


Fo 


d 2 

ma = m —rX — m x 
dt 2 


The solution for x(t ) can be written in terms of x and x at t = 0 as 


(7) 


x(t) = p^-t 2 + T(0) t + x(0) (8) 

2m 

We will see one general way of getting this result later. But is easy to check that (6) is satisfied by 
computing the second derivative. 

F n 

x(t) = —t + T(0) (9) 

m 

x(t) = — (10) 

m 

This shows that the t 2 term is right. You can check that the other terms in (8) are correct by setting 
t — 0 in (8) and (9). 

The latest version of Mathematica has a fantastic function that allows you to input initial con¬ 
ditions in real time. Check out the notebook lecture-l-l.nb. 

There is nothing special about t = 0. We could just as well write the solution in terms of the 
position and velocity at some arbitrary time t 0 , as 


x(t) = F°-(t - t 0 ) 2 + x(t 0 ) (t - to) + x(t 0 ) (11) 

2 m 

I hope that I have convinced you that F = ma implies two initial conditions/degree of freedom. 
That doesn’t mean that Newton’s second law is equivalent to A because we haven’t show that any 
system that requires two initial conditions/degree of freedom satisfies F = ma. But there is a 
mathematical sense it which this true at least for one degree of freedom. I am not going to talk 
about this in lecture, because the argument is a slightly intricate exercise in multivariable calculus. 
But it may be fun for those of you who like that sort of thing, so I have included it as an appendix 
to the lecture notes. 

What have we really learned from this mathematical philosophy? I have argued that Newton’s 
second law is actually equivalent to assumption A — each implies the other. Of course, we haven’t 
really understood why F is equal to ma, because we still don’t know why assumption A is true. 

That leads naturally to another question. Must A be true for some reason? Can we imagine a 
world very different from our own in which it is not? Actually, I don’t think A must be true. In 
fact, I don’t really think it is true. Like most of the things we teach as “physical laws”, it is only an 
approximation to what is really going on. We will come back to this much later. But this is a nice 
example of the kind of thinking that we will be doing a lot of in this course. 
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Should you take Physics 16? 

Because this is the first day of the class, you are all full of adrenaline and are paying close attention 
to every word I say. But normally, at about this point in a 1.5 hour lecture, you would all be 
falling asleep and would need a little something to perk you up. This is where we normally put 
a miniexam, a small problem that you work on with a random group of other students. Since we 
haven’t done enough yet to warrant a miniexam, I thought that instead, I would say a bit about the 
course, and then see whether you have any questions and maybe ask you some questions myself. 
There is a lot more detailed information on the information sheet on the web page. I work hard to 
keep it updated, so please read it carefully. 

Presumably you are all here because you have good preparation in physics and math and you 
are afraid that Physics 15a would be boring. The first thing to say is that Physics 15a would 
probably not be boring. It is a good rigorous course, and almost all of you would leam something. 
However, for many of you, Physics 16 may be more fun. 

As you may have guessed from what has happened so far, I will spend a lot less time on statics, 
assuming you already know about adding up vector forces and torques on objects that are not 
moving. We have already started looking at things moving around. In equations, I won’t spend 
time thinking about F — 0. We are going right to F — m a. 

This will give us some time to do cool things that are not done in a good conventional college 
physics course like Physics 15a, including the following: 

1. linearity and normal modes 

2. Lagrangian mechanics and Noether’s theorem 

3. relativistic strings 

4. the moment of inertia tensor 

5. inflation before the big bang, dark matter and dark energy 

We won’t explore any of these ideas in the depth that they deserve. For that you will have to take 
more advanced physics courses. But neither will we look at them qualitatively or superficially. 
From each of these subjects, we will try to extract some beautiful and important pieces that we can 
analyze honestly and in detail. 

I would make one request. Don’t take 16 because you think that it is going to be hard for you 
and you want a challenge. And don’t drop it because you think it is going to be hard for you and 
you don’t want a challenge. In fact, Physics 16 is in many ways a very humane course. We will 
do civilized things like having a take-home component of our exams so that we can correct for 
problems that some people have with time pressure on exams. And while the material is hard, 
as long as you get the help that we are eager to give you, the work is really not very different 
from that in 15a. For some of you it may take a little more time, so as I say at the end of the 
information sheet, it is important to think about it in the context of your whole schedule, curricular 
and extracurricular. 
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Finding trajectories numerically 

If you are given a complicated force law, depending (for one degree of freedom) on x, x and t, you 
will usually not be able to solve for the trajectories in terms of known functions. However, you 
can always find them by brute force, solving F = ma numerically. One simple way to do this is 
to make small time steps and use the most important formula in Physics — the Taylor expansion. 
Suppose that you are given the initial conditions, x(t 0 ) and x(t 0 ). Then you can find x and x 
approximately for a slightly later time, f 0 + At, by Taylor expanding the function x(t 0 + At) 
around f 0 . The Taylor expansion looks l ik e this: 

x(t 0 + At) — x(t 0 ) + At x(t 0 ) 4 - ( 12 ) 

where the • • • are terms with more powers of At. I love the Taylor expansion, and we are going 
to come back to it many times in the course, and later we will talk about the higher order term. 
But here, these can be ignored if At is sufficiently small. This first term in the Taylor expansion is 
nothing more than the definition of the derivative: 

x(t 0 + At) — x(t 0 ) + At x(t 0 ) -f- x(t 0 ) = 1 — + _|- ( 13 ) 

How does this help us find x(t). This is fine as far as it goes, because we are given the initial 
conditions for x(t 0 ) and x(to). But what we would l ik e to do is to be able to iterate this procedure 
and find x(t 0 + nAt) which would mean that we could find x(t) (at least approximately) at a 
whole sequence of future times. And in the next step we have a problem because we don’t know 
x(to + At) from the initial conditions. But now the key point is that we can also keep track of x 

x(to + At) — x(t 0 ) + At x(t 0 ) -4- (14) 

And we know x(t 0 ) from F = ma: 

x(t 0 ) = a(t 0 ) = — F(x(t 0 ),x(t 0 ),t 0 ) (15) 

m 

You can always do this for F — ma. Given x and x at some time f 0 , you can always find x 
and x for an infinitesimally later time, f 0 + At. 

Now that you know x and x for the later time t 0 + At, you can apply exactly the same procedure 
again and compute them for t 0 + 2 At. 

Then do it again and get x and x for t 0 + 3 At. 

And so on! 

This kind of iterative procedure is ideal for a computer — the computer just has to do the 
same things over and over again to construct the approximate trajectory. And if you want your 
approximation to be more accurate, you just have to make At smaller, so that the ■ • • that you 
throw away in (12) and (14) are smaller. If you are doing this on a computer, it doesn’t matter 
how complicated the function F is. If we have time, we will play with the Mathematica notebook 
lecture-l-2.nb where is process is carried out for some simple and not so simple force laws. 

But sometimes, it is nice to have analytic expressions for the trajectories. You may be able 
to find such things if the force is a simple enough function. For the rest of this lecture and on 
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Thursday, we will give examples in which this can be done, both in general, using integration, and 
in specific cases where the integrals can be done analytically. 

Meanwhile, notice that the analysis we have just given really is a “physicist’s proof” of the 
statement that a second order differential equation has a solution that is fixed when we know q and 
q at some time. We have “proved” this by actually constructing the solution! This is the best kind 
of proof for a physicist — one that not only tells you that the solution exists, but actually shows 
you how to find it. Of course this would never satisfy a mathematician — too many loose ends? 

But note that we can “prove” the general theorem this way — that an n-th order differential 
equation requires n initial conditions. Because our original equation was a second order differential 
equation, we had to keep track of both x(t) and x(t) while we take our small time steps. In a first 
order differential equation, we would only have had to keep track of x(t), because the differential 
equation would tell us x{t) directly in terms of x(t). Then we need only one initial condition. In 
a third order differential equation, we would have to keep track of x(t), x(t) and x(t). because the 
differential equation only tells us the third derivative. Then we would need three initial conditions. 
And so on. 

Forces of the form Fit) 

The big general principle of mechanics that we have talked about today is that we need two initial 
conditions per degree of freedom to specify how a system moves. Staple this in your brains. This 
is something we will come back to and will try to understand better. But now and next time we 
will spend a bit of time discussing some examples of the different ways these initial conditions can 
appear. There are more in Chapter 2 of Dave Morin’s book. Depending on how we are doing on 
time, I may move this discussion to the next lecture. 

The simplest and least interesting example of a force law in which the trajectories can be found 
formally using integration is a force depending only on t, not on x or x. In this case, we can simply 
use the fundamental theorem of integral calculus. The acceleration is the time derivative of the 
velocity, so we can write F = ma in this case as 

a{t) = F+ V ( t ) = — F ( t ) (16) 

at m 

The fundamental theorem tells us that the general solution to (16) can be written as 1 

rt 1 

v(t) = / — Fit') dt' + constant (17) 

Jt 0 m 

There are a number of things to note about (17): 

1. We can easily verify that it works, because if we differentiate the left hand side with re¬ 
spect to t, we just get the acceleration, a(t). On the right hand side, integral calculus tells 

'Note that t' here is a dummy variable. We will discuss dummy variables in more detail below, but there is a 
possible notational confusion. The symbol t' does not mean “the derivative of t” but is just a symbol for a new 
independent variable. We could have called it s instead, but Dave’s book uses the t' notation in this way, so I am trying 
to do it in lecture as well. You will have keep your wits about you to figure out what is going on, but you can almost 
always tell from the context. If a ' appears on a function, it is a derivative. If the ' appears on a variable, it is a new 
variable. Thus f'{x) means the derivative with respect to x of the function f(x), but /( x') means the function / of 
the variable x' and f'{x') means the derivative with respect to the variable x' of the function f(x r ). 
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us that differentiating the integral with respect to t gives the integrand evaluated at t, and 
differentiating the constant gives zero, so we get 


1 

ait) = — F(t) 
m 


(18) 


which is F — ma. 

2. We need to invent a new symbol for the argument of the integrand, because the previous 
argument only works if the time dependence appears only as the upper limit of the integral, 
so we have just put a prime on t. The dt' in (17) identifies the variable t' as a completely 
independent “dummy” variable. It is a “dummy” because in the expression (17), it doesn’t 
have any particular value. It is just a symbol to indicate exactly what integral we are doing. 
This is simple but important. It has a number of consequences. Nothing depends on a dummy 
variable. For example in (17) 

—F(t')dt' (19) 

Jt 0 m 

we could just have well used some other symbol inside the integral: 



A particularly important consequence of this is that you should never ever write an equa¬ 
tion like ( 

A(t)= I B(t)dt (21) 

Jto 

This doesn’t make sense because the variable t in (21) appears in two ways — as a real 
variable and as a dummy variable — and it can’t be both. Thus (21) doesn’t mean anything 
at all, and if we see it on one of your problem sets or tests we will deal sternly with it! 


Also, the integral 


is not the same as 



( 22 ) 

(23) 


Both of these expressions are sensible, they are just not the same. In fact, we can do the first 
integral explicity because the integrand B(t) doesn’t depend on the integration variable, the 
dummy variable t', we can take it outside the integral and write - 


[ B(t) dt' — B{t) f dt' — (t — t 0 ) B(t). 

Jt 0 Jto 


(24) 


3. The integral needs some lower limit, and we have called this t 0 , expecting it to have some¬ 
thing to do with the initial conditions. 
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4. Finally, (17) appears to depend on both £ 0 and the unknown constant. But we can determine 
the constant by just setting t = t 0 , which gives 

rto 1 

v(t 0 ) = / — F(t') dt' + constant (25) 

Jt 0 m 

The integral vanishes here because the range of integration is zero, so we know that 


constant = v(t 0 ) — x(t 0 ). 


(26) 


We sometimes call the initial condition x(t 0 ) an integration constant because of the way it 
appears in (25). 


Putting (26) back into (17) gives the final result for the velocity, 

v(t) — [ — F(t') dt' + v(t 0 ) (27) 

J t 0 m 

Another way to think about the result (27) is from the following chain (really just going back¬ 
wards through the same argument): 






v(t) -v(t 0 ) 


(28) 


which is equivalent to (27). 

Now that we know v(t) as a function of t (at least formally, in terms of an integral), we can 
find x(t) by just repeating the procedure. But v is the time derivative of x, we can write 

x(t) = f v(t') dt' + x(to) (29) 

Jt 0 


where again, the initial condition appears as an integration constant. Then we can put (27) into 
(29) to get 

v(t') 




dt' + x (to) 


(30) 


Note the new variable of integration, again to keep from getting the integration variable confused 
with the range of integration. Finally, we can do the integral for the term proportional to v(t 0 ), 
because it is just a constant, and get the result 


x(t) = I [ — Fit") dt" dt' + v(to)(t — t 0 ) + x(to) (31) 

Jto Jt 0 m 

The terms in (31) are easy to understand. The last term is where the object started — the initial 
condition for the position. The middle term describes motion with constant velocity v(t 0 ), which 
is what the object would be doing if there were no force on it. And the first term is the effect of 
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the force. Notice that (31) is valid for a constant force F(t) = F 0 (a constant is just a particularly 
simple function of t). And sure enough, we can do the integral in that case and get (11). 


x{t) 


ff 

'to J t 0 

rt rt' 


Jto Jto 

1 * Fo 

Jt 0 m 
— (t 

2 m v 


1 

— F(t") dt" dt' + v(t 0 )(t — t 0 ) + x(t 0 ) 
m 

Fn 

— dt" dt' + v(to)(t — to) + x(to) 
m 

{t’ - to) dt' + v(t 0 )(t - t 0 ) + x(to) 

- t 0 ) 2 + v(t 0 ){t - t 0 ) + x(t 0 ) 


(32) 


A A Implies F = ma 

I am not going to talk about this in class, but here is a little mathematical appendix for those of you 
who are interested. At least for a single degree of freedom, assumption A actually implies, under 
very mild assumptions, the existence of a second order differential equation l ik e Newton’s second 
law for the trajectories. 

To see this, let us restate A in mathematics. Suppose that a system has one degree of freedom 
so that the configuration of the system is given by the value of a single variable x (it won’t matter, 
but it makes the language simpler to talk about only a single coordinate). Since A is the statement 
that the trajectory is determined by x and x at some given time, call the “given time” t 0 , then A 
implies that we can write any possible trajectory x(t) as a function of four variables: t (of course); 
the time t 0 at which we specify x and x; and of the “initial” values x(t 0 ) = x 0 and x(t 0 ) = t’ 0 . A 
is just the statement that such a function exists. Since it exists, we can give it a name G and write 
the statement A in mathematics as 


A x(t) = G(x 0 ,vo,to,t) (33) 

Furthermore, because we can specify x and x at any time we choose. We can use any t 0 in (33) 
and get the same trajectory so long as we take xo and v 0 to be the values of the position and the 
velocity on that trajectory at the time t 0 . This function G contains everything about all possible 
motions of the system! 

The mild assumption we need to get F = ma from this is a smoothness assumption. We must 
assume we can differentiate our function x(t) with respect to time. Obviously, if we can’t do this, 
we have no hope of getting to Newton. This assumption is really very mild. 

Now here is the basic idea. Because of the meaning of the initial conditions, we can put 
Xo = x(t 0 ) and v 0 = x(t 0 ) into (33) and rewrite it as 

x(t) = G(x(t 0 ),x(to),to, t) (34) 

This is a peculiar equation, because the trajectory x(t) appears on both sides, on one side inside 
the function G. Because of this, the equation is telling us something about the function x that 
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describes the trajectory. It is still more complicated than it has to be though, because it involves 
two different times, t and t 0 . But the other peculiar thing is that the right hand side of (34) must be 
completely independent of t 0 , because t 0 doesn’t appear on the left had side at all. We can use this 
to get information on the trajectory that only depends on a single time. In particular, now we can 
start differentiating both sides of the equation with respect to t and afterwards, setting t 0 equal to 
t to get an equation that only depends on the the trajectory and its derivatives at a single time. We 
will see that differentiating zero or one time doesn’t give us anything interesting - just tautologies. 
But differentiating twice is interesting. 

So here goes. First we set t 0 —■> t in (34). 

x[t) = G^x(t),x(t),t,t^j (35) 

This is a tautology, because we can use the time t as the initial time to specify the trajectory, and 
this just says that the particle is where it is supposed to be at time t. To put it another way, if I 
change all the ts to t 0 in (35), 

x(t 0 ) = G(x(t 0 ),x(t 0 ),to,to) = G(x(t 0 ),x(t 0 ),to,t) = x(t)\ t=tQ (36) 

this is just the statement of the initial condition for x(to). 

Likewise, differentiating once with respect to t gives us nothing new. 

d d 

x(t) = —G(x(to),x(t 0 ),t 0 ,t) = — G(x(t 0 ),x(t 0 ),t 0 ,t) (37) 

Again we can set t 0 —> t because the left hand side doesn’t depend on t 0 . The equation (37) just 
says that the particle is going as fast as it should be going at time t. 

d d d 

x(t 0 ) = — G(x(t 0 ),x(t 0 ),toAo) = — G(x{t 0 ),x{to),to,t)\ t=ta = -Q t x{t)\t=t 0 ( 38 ) 

But differentiating twice with respect to t gives us a second order differential equation that the 
trajectory must satisfy. And this is Newton’s second law. 

Q2 q2 

x(t) = QpG(x(t 0 ),x(t 0 ),t 0 ,t) = Qp G(x(t 0 ),x(t 0 ),to,t) tQ=t (39) 

The right hand side of (39) is just some function of x(t), x(t) and t. Thus as promised, is a second 
order differential equation for x(t). This is just F = ma with 

d'2 

F(x(t),x(t),t) = m Q^ G(x(to),x(t 0 ),to, t ) tQ=t (40) 

From the function that is the mathematical description of the assumption A that the trajectory is 
determined by an initial position and velocity, we can find the force that makes F = ma work. 
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lecture 2 


Topics: 

Where are we? 

Forces of the form F(v) 

Example: F(v) = — m T v 
Another example: F(v) = —m(3v |u| 

Review of the harmonic oscillator 
Linearity and Time Translation Invariance 
Back to F(v) = —m T v 
Appendix - cross products - preview 

Where are we? 

Last time, we discussed Newton’s second law — F — ma. I tried to convince you that this is 
essentially equivalent to the statement that the motion of any given classical mechanical system 
is determined by a set of initial conditions, the values of the coordinates which specify its con¬ 
figuration, and their first derivatives at any given time. I also suggested that this is a very deep 
and interesting fact about the world, and promised that we would come back to it at the end of the 
course and give at least a provisional explanation of it. 

We also discussed how to solve for the motion of a system numerically by keeping track of q 
and q as functions of time, and using the Taylor expansion and F = ma to calculate approximately 
how they change in a small time step At. By putting together many small time steps, we can trace 
out the trajectory of the system. This procedure works for any any number of degrees of freedom, 
and it should convince you that in principle, giving a second order differential equation for the 
configuration of a classical does just what expect - it determines the trajectory in terms a set of two 
initial conditions per degree of freedom. 

In a sense, our numerical analysis completely solves the problem - at least least your computer 
can construct the solution to any problem. But for us people, it is nice to have analytic solutions that 
we can use to develop our intuition. So we also talked about systems with one degree of freedom 
in which the force depends only on time. In this VERY special case, we found that we could write 
down the formal solution simply by integration. Then if the integral can be done analytically, we 
get a completely analytic solution. We will go over this again quickly in lecture because we didn’t 
get to talk about it last time. 

Today, we will give some more examples of very special systems in which we can do more 
than just solving numerically. In some sense, I will just be showing you a collection of dirty tricks, 
because it is only in very special cases that they work. But more generally, today’s lecture should 
be regarded as a bunch of examples of the different ways in which initial conditions can enter into 
the solutions of classical mechanics problems. We always need two initial conditions per degree 
of freedom. But they appear in the actual trajectories in many different ways. In fact, I have 
something else in mind as well. At the end, when we come to discuss the harmonic oscillator, we 
will see that there are some very important general principles at work. These will be very useful, 
and we will come back to them many times. 
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Forces of the form F(v) 


Formally, it is almost as easy to solve for the trajectory for a system of one degree of freedom 
in which the force depends only on v as for a force that depends only on t. For a force F(v) 
depending only on v, we write F = ma as 


We can rewrite this as 


dV T?( \ 

It = F(v) 

(1) 

m dv 
- , — dt 

(2) 

F{v) dt 


which we can simplify just by integration. Again we have to remember that the integration vari¬ 
ables must be distinguished from the ranges of integration. Again I will use the notation in Morin’s 
book and just put primes on them. Putting in all the steps very formally, we get 



dt' — t ~ to — 


r l m dv' , 
I to F(V) ~dF dt 


f v d) m 
/ —r dv' 

Jv(to) F(v ) 


(3) 


The last step is a change of variable in the integration from t' to v' = v(i'). A quick and dirty way 
of getting to (3) is to write 

777 

dt = dv (4) 

F(v) 

and integrate both sides. 1 This is a perfectly good way of looking at it. The equation (4) describes 
how a small change in t is related to the small change in v that takes place during the small change 
in t. Integration adds the small changes up. 

Equation (3) implicitly determines v(t) in terms of t — t 0 and the initial condition v(t 0 ). And 
once we know v(t), we can integrate to get the trajectory x(t) (using the initial condition x(t 0 ) as 
usual). 

Let’s see how this works in an important example, describing a frictional force. 


Example: F(v) = —(3v = —mVv 

If an object moves very slowly through a thick gas or a liquid, there is almost always a frictional 
force approximately proportional to the velocity (and in the opposite direction). What “slowly” 
means depends on the size of the object and the viscosity of the stuff it is moving through. We will 
come back later in the lecture to why this force should exist. For now, let’s see how (3) works for 
such force, which we can take to have the form 2 


F(v) = —(3 v = — m T v 


(5) 


(3) becomes 



rv(t) 1 


v(t 0 ) r v' 


dv' 


( 6 ) 


'When we do that, we must turn them into dummy variables, which we do, as usual, by putting primes on the. 

2 The factor of m is just there to make the units of the constant 1 simpler. It makes the ms cancel in (3). Physically, 
the factor doesn’t really make any sense, because the frictional force on an object just depends on its size, shape and 
surface properties — not on its mass. But of course the essential physics doesn’t change when you leave out the m 
and write things in terms of (3 - the formulas just look a bit more complicated. 
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This looks nicer if we put the T on the other side 


r , f vW 1 / 

r / dt = — - dv' 

Jto Jv(to) v 

(7) 

T (t — to) — — lnv(f) + lnn(fo) 

(8) 

lnn(f) = lnv(to) — T (t — t 0 ) 

(9) 

gin v(t) _ gln u(to)— r (t—to) 

(10) 

v(t) =v(t 0 )e- r{t ~ to) 

(11) 


This makes sense. The constant T has units of 1/T, so the exponent is dimensionless, as it should 
be. Thus the units work in (11). We can also check this in the limit T —> 0. In this limit, where 
the frictional force disappears, the velocity goes to a constant, as it should. For nonzero T, the 
frictional force causes the velocity to gradually drop off. Because the force is proportional to v, 
the drop-off gets slower and slower as the velocity gets smaller, in this case, exponentially. A plot 
of the velocity (in units of v 0 ) as a function of time (in units of 1/T) has an exponential shape that 
I hope is familiar: 



r(t-t o) -»• 

Now let’s compute x(t) by integrating (11). 

pt pt 

x(t) = x(to) + / v(t') dt' = x(to ) + / v(t o) e~ L dt' 

Jto Jto 

Doing the integral is easier if we substitute 

u = e~ T du = -T e~ r dt' 


( 12 ) 


(13) 
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we get 


v(tn) r e r( ' t to) 

x(t) = x(t 0 ) -— l du (14) 

which gives 

x{t) = x(t 0 ) + (l - e~ r(t - to) ) (15) 

Note that the object moves a finite distance, n(f 0 )/r, in infinite time. The distance traveled looks 
like this, where for comparison, I have included the linear extrapolation of the initial velocity. 



r (t -1 0 ) 


As usual, it is really really fun to play with this in with Mathematica. Check out lecture-2-l.nb 
included with the lecture notes. 

In Dave Morin’s book, he works out a slightly more interesting example in which there is also 
a constant component of the force. This is relevant to objects falling in a gravitational field and 
also subject to friction. If the force looks like F(v ) = F 0 — mV v, the velocity as a function of 
time is 

v{t) = v(t 0 ) e- r(t ~ to) + ^%(l- e - r{t - to) ) (16) 

m IV / 

One interesting thing about this is that v(oo) = F ° r independent of vq. Whatever the initial condi- 
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tions, v approaches the same “terminal” velocity, as illustrated in the graph below: 



0 1 2 3 4 5 


r(t-to) -> 

If r is very large, this gives a ki nd of Aristotelian physics. When you push on something, it 
moves with a velocity proportional to the force. If T is large enough, you might not notice that 
it takes a little time (1/T) to settle down to this terminal velocity. 

Another example: F(v) = —m P v \v 

If an object moves rapidly through a gas, there is a frictional force proportional to v v \. The funny 
form of the force with absolute value, |u|, is something that you will explore on the problem set, 
so I won’t talk about it now. I will just assume that v > 0 so that v |n| —> v 2 . The force arises 
because the object knocks the gas molecules out of the way, and the force is proportional both to 
momentum change of the knocked molecules (which contributes a factor of —v to the force) and to 
the number of molecules that are knocked per unit time (a factor of |n|). Now let’s begin the same 
analysis for such a force, of the form 



F(v) = —m(3v 2 


(17) 

Now (3) becomes 

ft , f v (t) 1 . 

dt = / 2 d;v 

Jto Jv(to) (3v' z 


(18) 

This looks nicer if we put the (3 on the other side 




ft fv(t) 1 

p dt' = - ( , dv' 

Jto Jv{to) V 


(19) 

or 





1 1 

P {t to) — f.S /, \ 

v(t) v{t 0 ) 


(20) 

or 

1 1 

■+/>(*-*.)= > 

V(t 0 ) v(t 0 ) 

_ 1 + Pv(t 0 ) (t - to) 

(21) 

v(t) v(t 0 ) 

v(to) 
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0 1 2 3 4 5 


fiv(t 0 )(t-t 0 ) -*• 

Compared to an exponential that initially falls at the same rate, this one falls off more slowly later, 
because the the frictional force gets smaller faster as the velocity decreases. 



I won’t work out the details here, but you can now integrate this again to get the position as a 
function of time: 

x(t) = x(t 0 ) + - log^1 + p v(t 0 ) (t - t 0 )) (23) 

Again, we can use lecture-2-l.nb on the Mathematica page to see what this looks like. 

I did another example basically just to show how differently the initial condition (the value 
of v(t 0 ) in this case) comes into to the final result in different cases [compare (22) and (23) with 
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(11) and (15)]. The trajectories always depend on two initial conditions, but what that dependence 
looks like depends on the form of the force. You have to think and keep your wits about you — if 
you try to remember formulas for all the different possible force laws, you will just get confused. 
Instead concentrate on understanding the derivations! But nevertheless the matra — two initial 
conditions per degree of freedom will be a useful check that you are doing sensible things. 

Review of the harmonic oscillator 

Dave Morin talks about forces of the form F{x) in his chapter 3, which are reading. But I am 
going to postpone a general discussion of until after the problem set. The general treatment of 
these forces is essentially just an application of the principle of conservation of energy, so it makes 
sense to wait until we talk about energy. Instead, we will review the simplest example of such a 
force - the harmonic oscillator with 

F{x ) = — K x . (24) 

There is another good reason for thinking about this. In general, if the force depends on both x 
and v (or x, v and t). things are much harder. But by cleverly adding damping and driving to the 
harmonic oscillator, (24), we can include both x and v dependence, and t dependence as well, and 
still say interesting things rather simply. We will talk about this starting next time. 

The harmonic oscillator is a very important mechanical system. We will start today at the 
beginning but will go on to try to explain why it is important. The simplest example of a harmonic 
oscillator is a mass attached to a spring, shown here sitting on a surface, which I will assume is 
frictionless. 


(25) 

There is an equilibrium position in which the spring is neither stretched nor compressed. In 
equilibrium, the spring produces no force on the mass. If the mass moves so that the spring is not 
in equilibrium, the spring exerts a force on the mass that tends to move it back to equilibrium. If 
the mass is released from out of equilibrium, it accelerates towards the equilibrium position. When 
it gets to the equilibrium position, the force goes to zero, but it is moving, so inertia keeps it going 
to the other side of equilibrium. This is the classic recipe for oscillation. 

But so far, this is just oscillation. What makes this a harmonic oscillator is the special force law 
of the ideal “Hooke’s Law” spring, in which the force is proportional to minus the displacement 
from equilibrium. If the block moves in the x direction, and we choose a coordinate system in 
which x = 0 at equilibrium, then the force on the block looks like 

F = —K x (26) 

where K is called the spring constant. The larger the spring constant, the stiffer the spring. 

Now Newton says 

d 2 

F — ma — m -—a; — m x — —A x (27) 

dt 2 
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You probably know that the general solution to the second order differential equation (27) is 


x(t) = a cos cot + b sin ut 


where 


If you don’t know this by heart, that is fine. Next time we will leam how to solve such things in 
general in a very easy way. But meanwhile it is always possible to check that our solution works 
even if we don’t know where it comes from. Differentiating (28) twice gives 

x(t) = —aco sin cot + bu cos ut (30) 

x(t) = —aco 2 cos cot — bu 2 sin cot = —u 2 (a cos ut + b smut) = —u 2 x{t ) (31) 


so that 


mx(t ) = —mu 2 x(t) = — m — x(t) = —K x(t) 

m 


The constant u is called the angular frequency. It is fixed by the physics - the values of the 
mass and the spring constant. But as we expected because we need two initial conditions/degree of 
freedom, there are an infinite number of possible trajectories, labeled by two constants, here a and 
b. Again as expected, we can determine a and b by imposing initial conditions. The details of the 
way the constants enter are different here than in our previous examples, but at least the number of 
initial conditions is right — two constants for one degree of freedom. Setting t = 0 in (28) gives 

x(0) = a cos 0 + b sin 0 = a (33) 

Thus a is the position of the mass at t = 0. Setting t = 0 in (30) gives 

v(0) = x(0) = —acusinO + bu cosO = bu (34) 

Thus bu is the velocity of the mass at t = 0. Thus we can rewrite (28) as 

x(t) = x(0) cos ut H—— sin cut (35) 

u 

As we will discuss in more detail later, we can easily adapt (35) to use the initial conditions at any 
time t —to as follows: 


/ \ \ f , ,1 V\t o) r , . 

x(t) = x(to) COS U[t — to) H- Sin u(t — to) 

-I (i) L 


This comes alive in lecture-2-2.nb on the Mathematica page. 
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It is interesting to collect the various examples we have discussed in one table. 


Force trajectory x(t) 



What I think is interesting about this is that it shows very clearly that while each of these force 
laws requires two initial conditions, which we can take to be the position and velocity at the time 
to, the way in which these two constants appear differs dramatically from one force law to another. 
This is an important lesson. There are always two initial conditions per degree of freedom, the way 
the initial conditions appear in the trajectory depends on the force law. 

There is a very different way of writing the solution (28). Using (I hope familiar) trigonometric 
identities, we can write, with 


t 0 — 0 a = x(0) b = v(0)/u 

(38) 

x(t) = a cos cot + b sin cot = ccos(ut — 0) 

(39) 

= c (cos ut cos 0 + sin cot sin 0) 

(40) 

= c cos 0 cos ut + c sin 0 sin ut 

(41) 

a = c cos0 b = c sin0 

(42) 

c = \/a? + b 2 0 = arctan - 

(43) 


a 


The constant c is called the amplitude of the motion and 0 is the phase of the oscillation. (40)-(43) 
show that the two constants c and 0 carry the same information as the two constants a and b. Both 
pairs are determined by the initial conditions and either pair completely determines the trajectory. 
The connection between these constants and the trajectory is illustrated in (44), a graph of the 
trajectory x(t) versus t. 
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Because the sin and cos functions are periodic with period 27 t, 

x(t + 27T /oj) = ccos [u(t + 2 tt/oj) — (ft\ = ccos(u>t — (f) + 27t) = ccos(ut — (j)) = x(t ) 

and thus the motion repeats after a time 

r = 2tt/u which is called “the period of the oscillation.” 

The frequency (as opposed to the angular frequency) is 

u 1 
2n t 

The product cut — the argument of sin and cos is an angle. 



, . cycles 


, . radians 

Note that 

v has units- 

and 

u has units- 


sec 


sec 


(45) 

(46) 


(47) 


(48) 


These are really the same units in the sense of dimensional analysis because 27r is dimensionless, 
but thinking about the units as stated in (48) will help you remember the factor of 27 t which is just 
the conversion factor between radians and cycles. It will make even more sense soon when we 
make the connection between harmonic oscillation and uniform circular motion. 

So what is it that is special about harmonic oscillation? — two general principles — 


Linearity and Time Translation Invariance 

Harmonic oscillation occurs almost everywhere. There are lots of physical systems whose motion 
is described, at least approximately, by solutions to the same equation as the mass on a spring. 
Why is that? 

d 2 

m —-x + K x = 0 (49) 

at 1 

There are two key features. 

1: Time translation invariance — there is no explicit dependence on time. There are dts — in 
the derivatives with respect to time. But there are no ts. When the equation for motion has this 
property, it follows that if x(t ) is a solution then so is x(t + a). You can easily prove this using 
the chain rule. 

Time translation invariance is particularly useful for the harmonic oscillator because of the 
other special property of its force law. 

2: Linearity — because all the terms are proportional to one power of x or its derivatives, we can 
make new solutions as linear combinations of old ones — if x\ (t) and xAt) are solutions then 
so is A x\ (t) + B x 2 (t). Again, this can be checked explicitly. 

Time translation invariance is physically very reasonable. The laws of physics don’t change 
with time. Or if they do, they do so very slowly (maybe on the scale of the age of the universe, 15 
billion years or so), so that we can’t tell. There are lots of systems in which the coefficients (things 
like mass, spring constant, inductance, capacitance, etc) are determined directly by the laws of 
physics and are therefore independent of time. 

Linearity may seem more like an obscure mathematical concept — what does it have to do 
with physics? Why should there be so many systems that are approximately linear? The word 
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“approximately” here is important. Probably, linearity is never exactly true for a classical system, 
but it is often an excellent approximation. 

So suppose that you are studying oscillations in some system, and all you know about it is that 
it oscillates about a point of equilibrium. Let’s call the quantity that measures the displacement 
from equilibrium x, to make it look like the mass on a spring. We expect that the time evolution 
of the system can be described by the solutions to some second order differential equation (like 
F — m a): 

% X = T{ - X) (50) 

where T is the analog of the F/m in F = m a. Because I’ve assumed that we are in equilibrium 
at x = 0, it must be that 

.F(0) = 0 (51) 

Now, assuming that Fix) is smooth, we can use the most important formula in physics — and you 
can probably guess what that is — the Taylor expansion 

F(x) = F(0) + xF{ 0) + ^x 2 F'(0) + --- = xF(0) + ^ 2 r'(0) + • • • (52) 

Then unless F(Q) is exactly zero, the first term will dominate for sufficiently small x. This is 
why linearity is so important. Most functions in physics are smooth. Most of the time, there is no 
particular reason for JT'(O) to be zero, so it isn’t. Thus the equations of motion for most systems 
are linear for sufficiently small x. 

I should perhaps just note that the most important example of linearity has little to do with 
classical mechanics — it is quantum mechanics itself. Just as classical sound waves or electromag¬ 
netic waves can add together and sometimes interfere constructively and sometimes destructively, 
so also the mysterious quantum matter waves that describe quantum states can be added. As far 
as we know, the linearity of quantum mechanics is exact, not an approximation. At least, very 
sensitive experiments have failed to find any nonlinearities. 

Next time, we will discuss the consequences of time translation invariance and linearity in 
detail. We will see that when the physics of a system obeys these two general principles, the 
trajectories can be written in a very simple form, as sums of exponentials. But sometimes, the 
exponentials will be complex — that is they will involve i = \f—\. Furthermore the generality 
of argument will allow us to extend this result to systems with arbitrary numbers of degrees of 
freedom, with rather dramatic results. 

Back to F(v) = —m V v 

For now, let us now return to the force law that we considered at the beginning of this lecture, 
F{y) = —m T v. What is the physics of this force law? We saw that when friction arises because 
stuff gets knocked out of the way, we get a v 2 dependence on velocity. But this v 2 dependence is 
not linear. When v 2 is sufficiently small, we might expect that this effect will become negligible 
compared to other effects that give a linear dependence on velocity. Indeed, in most liquids, we get 
an approximately linear dependence of the frictional force on velocity for objects that are moving 
slowing enough. Then the molecules of the liquid are not so much knocked as they are gently 
pushed out of the way, so that the process is very smooth and reversible. This is a good excuse 
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to show you one of my very favorite demos, illustrating the smoothness that one gets in a very 
viscous medium, in which the linear regime is easy to reach. 

Appendix - cross products - preview 

The cross product is essentially just an antisymmetric combination of two vectors. This anti¬ 
symmetric combination of two vectors in interesting because it defines a plane, and planes are 
intimately connected with rotations. The particularly convenient thing about this combination in 
three dimensional space is that it behaves like another vector. The cross product is the mathe¬ 
matical statement of the fact the antisymmetric combination of two vectors in three dimensional 
space defines a plane which in turn defines another vector. The geometrical definition of the cross 
product is a good way to see that it behaves like a vector under rotations, so we will start with that. 
Then I will indicate how we can show that this geometrical definition is equivalent to a definition 
given in terms of components. 

The geometrical definition is this: 

Given two vectors, A and B, the object A x B is a vector with magnitude (53) 
A B sin 6 where 6 is the angle between A and B defined as a positive angle 

between 0 and n. The direction of A x B is perpendicular to the plane formed by A 
and B with the sign determined by the right-hand rule. 

With this definition, it is easy to understand why Ax B behaves like a vector under rotations. The 
magnitude doesn’t change under a rotation because A , B and sin 6 are all unchanged. And the 

direction rotates properly because it is tied to the directions of A and B. 

It is crucial that the cross product Ax B is antisymmetric in the two vectors A and B, 

Ax B = -B x A (54) 

In the geometrical definition, this follows from the application of the right hand rule. If you 
interchange A and B, the cross product changes direction because the right hand rule goes from 
B to A rather than from A to B. This antisymmetry ensures that either the two vectors A and B 
define a plane or the antisymmetric combination vanishes. Then the fact that in three dimensional 
space, there is a unique direction perpendicular to a given plane allows us to turn the antisymmetric 
combination into a vector. 

The geometrical definition, (53), is equivalent to the following component definition, 

AxB I = A y B z - A z B y , \AxB] =A z B x -A x B z , \AxB] = A x B y - A y B x , (55) 

l A x a l Ay l A z 

where we are using a notation for vector components in which 

A = A x , A = A y , A = A z . (56) 

ax l Ay L A z 

If you have not seen cross products before in your math courses, you can look at the demonstration 
of this equivalence below. We will be using cross products a lot later on in the course when we talk 
about rotations in three dimensions. 
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To prove that (53) and (55) are equivalent, we first show that 


(Ay §) ■ A= (Ay §) ■ B = 0 

We can do this by explicit calculation. For example, 


( 57 ) 


(A x B) - A — ( A y B z — A z By)A x + ( A Z B X — A x B z )A y + ( A x B y — A y B x )A z — 0 


(58) 


The calculation for B is similar (as it must be because of the antisymmetry of the cross product 
- (58) just says that the dot product of the cross product with the first vector in the cross product 
vanishes - and because of antisymmetry the same must be true for the second vector in the cross 
product). Thus A x B is perpendicular to both A and B and therefore perpendicular to the plane 
they form, just as in the geometrical definition. 

You can see that the magnitude of the object given by (55) is right by explicitly calculating its 
square. 

(A x ■ (A x B^j = ( A y B z — A z B y ) 2 + (. A Z B X — A X B Z ) 2 + ( A x B y — A y B x ) 2 (59) 

= A X B 2 + A 2 x B: + A 2 B X + AyB 2 + A 2 B 2 + A 2 B 2 — 2A X B X A y B y — 2 A X B X A Z B Z — 2A y B y A Z B Z 

m 

If we add and subtract A 2 X B 2 X + A 2 B 2 + A 2 Z B 2 to this, the positive terms can be factored into 

[Al + Al + A^Bl + Bl + Bf) ( 61 ) 

and the negative terms into 

— (A x B x + A y B y + A z B?j (62) 

so we can write (59) as 


(AyB) 

■(AyB) = 

A ' 

) 

' B 

2 / -> ^\2 
-(A-B) = 

A ^ 

) 

' B 


(1 — cos 2 6) = 


A 


21 —» 12 „ 

B 


sin 9 (63) 


Finally, you can see the right-hand rule by calculating an example, like x x y = z. Thus we have 
checked that the component definition (55) is equivalent to the geometrical definition (53). 
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lecture 3 

Topics: 

Where are we? 

Consequences of Time Translation Invariance and Linearity 

F = —rriTv again 

The harmonic oscillator again 

The damped harmonic oscillator 

Uniform circular motion in the complex plane 

Harmonic oscillation for more degrees of freedom 

The double pendulum 


Where are we? 


Last time, we saw how initial conditions appeared in a number of different examples of force laws. 
The last of these, the harmonic oscillator, is a particularly important system because it has two 
general properties, time translation invariance and linearity, that appear in many many physical 
systems. 

Because linearity went by pretty quickly last time, let me briefly review how it works. The 
equation of motion, F = ma, for the harmonic oscillator is linear because there is a single x in 
each term. It can be written as 

ci^x 

m + K x = 0 (1) 

at 2 

We can think of this as a single “operator” acting on x. 


f d 2 \ 

m + K x = 0 
, dt 2 ) 


( 2 ) 


In this form, it may be more clear why you can add solutions together to get new solutions. If you 
have two solutions, x\(t) and x 2 (t), you can form an arbitrary linear combination of the two and 
the result will be another solution. 


( m + K ) Xl(<) 
( m ^ + K ) 


0 

0 


( 3 ) 



(axi(t) + bx 2 (t)j = 0 


1 



because 


= m 


m^ + K^j (axi(t)+ bx 2 (tf) 
+ k'J (axi(tf) 

‘' lt > + A j 


“ ( m Ip + K ) Xl(t) 

i { m ^ + K ) X2(t) 


( 4 ) 


= 0 


This fact has remarkable consequences, as we will see shortly. 


Consequences of Time Translation Invariance and Linearity 

Time translation invariance is an example of a symmetry. The physics of the harmonic oscillator 
looks the same if all clocks are reset by the same amount. When a symmetry is combined with 
the property of linearity, the result is an extremely powerful tool for studying the solutions of the 
system’s equation of motion. The reason is that because of linearity, the solutions of the equation 
of motion form what mathematicians call a linear space. You can add them together and multiply 
them by constants and you still have solutions. Because of this, we can use the tools of linear 
algebra to understand them. You should all be used to some of these tools even if you have not 
studied much linear algebra, because you all familiar with one linear space - the space of vectors 
in three dimensions. One of the really important things about linear spaces is that you are free to 
choose any convenient basis - like x, y and 5 in three dimensions. 

In the linear space of solutions of the harmonic oscillator, we can choose a convenient set of 
basis solutions that behave as simply as possible under time translations. For the symmetry of time 
translation, it is a mathematical fact that the simplest basis solutions are just exponentials. We can 
always find solutions of the form 1 

z{t) = z{ 0)e m (5) 

where H is some constant. What is special about this form (and I am not going to discuss this in 
detail -1 hope that you will see this beautiful argument in more detail in Physics 15c) is that when 
you change the setting of your clock by taking t —> t + a, the exponential (5) is the only function 
that just changes by a multiplicative constant, 

z(t + a) = 2 ( 0 ) e H{t+a) = z(0) e Ht e Ha = e Ha z(t ) (6) 

You can always use the linearity of the space of solutions to find particularly convenient solutions 
that behave in this simple way under time translations - and then the result has to be an exponential. 

'This is essentially equivalent to the statement that a linear differential equation with constant coefficients always 
has an exponential solution — see Morin’s text. 
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Once you realize that the solutions are just exponentials, you can find the possible values of H 
in a very simple way. If you do take (5) and put it into the equation of motion, time derivatives 
acting on e Ht just bring down factors of H. This converts the differential equation into an algebraic 
equation. 



(7) 


Then once we have the basis solutions, we can form linear combinations to satisfy the initial 
conditions, in the same way that we can write a general vector in three dimensional space as a sum 
of coordinates times basis vectors 

r — xx + yy + zz 


This simple dependence of f on the coordinates x, y and z is the power of linearity at work. We will 
see that it works in a similarly simple way for the initial conditions in a linear and time translation 
invariant mechanical system. 


F = —mTv again 

Let’s do this for the linear frictional force we talked about last time, of the form 


—m T v 


( 8 ) 


with 


T > 0 


(9) 


We know how to solve this directly by integrating, as we did last time, but you will notice, I hope 
that this force law is also time translation invariant and linear. There is just a single factor of x in 


v = 


dx 

dt 


( 10 ) 


and t enters only through derivatives. Therefore, we expect that the solution is a sum of exponen¬ 
tials in time multiplied by constant coefficients that depend on the initial conditions. The equation 
of motion has the form 

rn x — —rn Ti or m|i + Tij=0 (11) 

This allows us to solve the problem in a different way, by assuming that the solution is a linear 
combination of exponentials. If we put in a trial solution of the form 


x(t) = Ae Ht 


( 12 ) 


each dot becomes a factor of H and we get 

m[H 2 Ae Ht + T H Ae Ht ) =0 = m Ae Ht ( h 2 + Y H) =0 (13) 

The factor mAe Ht never vanishes except for the uninteresting case A = 0, so we can just cancel 
it and conclude that 


H 2 + TH = (H + T)H = 0^H = -TorH = 0 (14) 


3 



so there are two simple solutions, 

x{t) = Ae° = A and x(t) = Ae~ rt (15) 

The general solution, because of linearity, is then a combination of these two solution with coeffi¬ 
cients that depend on the initial conditions: 

x(t) = A + Be~ rt (16) 


Comparing with the solution that we got by direct integration, 


x{t) = x(t 0 ) 

, v (to) , 

r 


(17) 

we see that it is equivalent to (16) with 




A = 

= x(t 0 ) + 

v(t 0 ) 

r 

(18) 

and 

v(to) 

r 



B 

e rt ° 

(19) 


I hope you agree that this calculation was quite a bit easier and quicker than integrating. Using 
these general principles of time translation invariance and linearity is not only cool, it saves you 
work. As you will see in more complicated examples in a few minutes, it often saves you a LOT 
of work! 

The harmonic oscillator again 

Before going on to the harmonic oscillator, let’s describe this process in the form of two simple 
steps for dealing with systems with time translation invariance and linearity: 2 

1. Put a trial solution of the form A e Ht into your equation of motion. The derivatives become 
powers of H and you can find the values of H — hj that work by solving an algebraic 
equation. This gives you your basis solutions A e hjt 

2. Write a general solution by making a general linear combination of your basis solutions with 
arbitrary values of the As — 

x{t) = Y J A je hjt (20) 

3 

and find the constants Aj by imposing initial conditions, as usual. 

Now let’s apply these two steps for the mass on the spring. Step 1 is straightforward. 

mz(t) — —K z(t) (21) 

z(t) —> Ae m (22) 

2 The only time this procedure doesn’t work is when the algebraic equation you get has degenerate roots. We will 
see what to do in this special case later. 
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m^Ae m = —K A e Ht 
dt 2 


or m^Ae m + KAe m = Q 
dt 2 


mH 2 Ae m + K Ae m = 0 
(m H 2 + A') Ae m = 0 
m 77 2 + K = 0 = m(i7 + icu)(H — io;) 

[~K 

H = ±itu for i = '/—l and u = \ / — 

V M 


So our two basis solutions are 


Aujt 


and 


—iut 


(23) 

(24) 

(25) 

(26) 

(27) 

(28) 


The calculation in step 1 was very simple. The only curious thing here is that we are led to complex 
numbers. While time translation invariance tells us that there are solutions of the form A e Ht , it 
doesn’t tell us that H is real, and for oscillations, it isn’t. 

Now in step 2, we form the general solution by forming a general linear combination of our 
basis solutions. Thus the most general solution for the harmonic oscillator looks like this: 


x(t) = ce iuJt + de~ iut 


(29) 


Now this is a little peculiar. Unlike the situation with the frictional force, in (17), this doesn’t look 

the same as the cosine and sine that we got by solving the differential equation. But in fact, it is the 

same. The connection with sines and cosines is Euler’s formula, one of the more amusing relations 
in mathematics: 

e ie = cos 9 + i sin 6 (30) 

There are many ways of seeing this. Let’s just use the most important formula in physics again — 
Taylor’s expansion 

e" = 1 + (t#) + \(i0? + W 3 + i(») 4 + ■ ■ ■ (31) 

= (i - \<f 2 + 7(») 4 + •■■)+! (e - +...) (32) 

Because of Euler’s formula, you see that the general solution in terms of cosine and sine is 
completely equivalent to the general solution in terms of complex exponentials. 


x(t) = a cos cut + b sin u>t 

is equivalent to 

x(t) = ce iu}t + de~ iuJt (33) 

= c (cos out + i sin ut) + d (cos cut — i sin cut) 
a = c + d b = i(c — d) 

Notice again how linearity is at work here. Linearity is what guarantees that a linear combination 
of two possible trajectories is another possible trajectory. This is what allows us to write the most 
general solution as a combination of the two complex exponential solutions times constants: 

x(t) = ce iuJt + de~ iu}t (34) 
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It is the fact that the initial conditions appear in this extremely simple way as the coefficients of 
simple basis solutions that makes all of this work. 

If you haven’t seen this before, and perhaps even if you have, this probably looks really strange. 
And you might also be asking yourself, if this is equivalent to the familiar solution in terms of 
cosine and sine, what is the advantage of using these unfamiliar complex exponential. The right 
answer, I think, is that once you get used to using complex exponentials, they will simplify your 
life enormously. We will see this in a few minutes when we discuss damped oscillators, but the 
message is really more general. We don’t have to use complex exponentials. We can do everything 
using cosines and sines, using a combination of trigonometry and algebra. But with complex 
exponentials, all we need is algebra! 

In fact, Euler’s formula is the connection between algebra and trigonometry! You can define 
the trigonometric functions this way: 


piO | p iO 

cos 6= 2 


(35) 


£ 3 ^ _ £3 

sin 9 =- (36) 

2 1 

Now you can derive all trigonometric identities just using algebra, and you never have to do 
trigonometry again. 


Uniform circular motion in the complex plane 

One very evocative way to think about these complex solutions is in what is called “the complex 
plane.” Because a complex number has two real components, its real and its imaginary part, we 
can think of a complex number as a real vector in a two dimensional space in which the real part 
is the x component of a two dimensional vector and the imaginary part is the y component. This 
two dimensional space is the complex plane. Euler’s formula, (30), tells us that the basis solution 
e luJt has real part coscut and imaginary part sin cut, so its counterpart in the complex plane is the 
two dimensional vector, (coscut, sin cut), 

e ZUJt = cos cut + i sin cut —> (cos cut, sin cut) (37) 


But this is a unit vector an angle cut from the x axis. Thus as t increases, e Zujt executes uniform 
circular motion in the complex plane. You can see this in the Mathematica file lecture-3-l.nb. 

More generally, a complex number z = x + iy can be written equivalently as a positive number 
R times a complex exponential e ld . Note the connection of this with the relation between Cartesian 
and Polar coordinates in the complex plane. 


R& * (3?) y) Cartesian (-R, $) Polar 

(38) 

R = \z \ — \Jx 2 + y 2 

(39) 

9 = arg (z) 

(40) 

arctan (y/x) for x > 0 , 

arctan (y/x) + n for x < 0 . 

(41) 
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This connection is used in many practical applications to convert from circular motion to back 
and forth “reciprocating” motion, or vice versa. 3 


The damped harmonic oscillator 

We have now seen what happens when we have friction with a spring or a spring without friction. 
With a frictional term, the equation of motion for the mass on a spring becomes 4 

d 2 d 

m-— x(t) = —mV — x(t) — K x(t) (42) 

dt 2 w dt w w 

tt 2 X ® + V Jt X ^ + X<yt ' ) = ° ( 43 ) 

where cuo = \JK/m. This interpolates between the two systems we have just analyzed. As T —> 0, 
we recover the harmonic oscillator. As K —> 0, we get simple linear friction. But the general form 
still satisfies the conditions of time translation invariance and linearity. Therefore we still expect 
the solutions to be an exponentials and we can still use our two steps to construct the general 
solution. 

Step 1 is the usual. Because of time translation invariance and linearity, we can look for expo¬ 
nential basis solutions: 

z(t) = Ae m (44) 

f-Ae Ht + T^Ae Ht + u 2 0 Ae Ht = 0 (45) 

dt 2 dt 

As usual, derivatives with respect to t just bring down factors of II. so we can convert this to an 
algebraic equation: 

( H 2 + TH + uj 2 0 ) A e Ht = 0 (46) 

Notice that everything has gone exactly the same way here as in our two previous example. Indeed 
the only difference here is that the algebraic equation, (46), is a little more complicated that the 
previous examples. We need to use the quadratic formula to find the two solutions: 



where 

T > 0 ^ Re H < 0 Re H± < 0 
z(t ) oscillates if cug > T 2 /4 and it just dies out if c o 2 < T 2 /4 — 

| with t , -if if r / 2>UJ o ' 

z(t) oc e H±t = 7^ x , e Wry4-„g 

o if r/2<0J 0 

'i.e. http://www.rpi.edu/dept/chem-eng/Biotech-Environ/PUMPS/reciprocating.html 
4 The little 7 that appears in the discussion in Dave’s book is just F/2. 


(47) 

(48) 


( 49 ) 
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Step 2 is also the usual one. The general solution is a linear combination of the simple expo¬ 
nential basis solutions: 

x(t) = b + e H+t + 6_ (50) 

where H± are defined in (47). The constants b± contain the information about the initial conditions, 
just as in the undamped harmonic oscillator. You can see this in the Mathematica file lecture-3- 
2.nb. 

The Mathematica file starts by finding x(t) as a function of u 0 , T, and the initial conditions, 
x(0) and n(0). Then it graphs x(t) with all of these as knobs, starting with u 0 = 0 — that is 
pure damping with no spring. The mass is then moving back towards the origin (which is quite 
arbitrary until we put in the spring). The mass just slows down and stops somewhere (that depends 
on the initial conditions of course). Now look at what happens if you gradually increase the spring 
constant, and therefore increase ojq with T and the initial conditions held fixed. The graph also 
shows the values of the H± that appear in the exponent of (50) and of the constants b±. 

Now as we turn on a small u 0 term by putting in a weak spring, you might think that each of 
these components would start to oscillate. But that is not what happens. And it cannot possibly 
work that way because oscillating terms necessarily come in pairs - proportional to sin cut and 
cos cot or e ±lut and there cannot be two such pairs with different exponential dependence or we 
would need more than two initial conditions to determine the trajectory. Instead, the small spring 
term produces two terms with exponential decay and the decay constants get closer together as 
cco increases. You can see that this must be so by reminding yourself about where the quadratic 
formula comes from. The two exponentials, e H±t , are determined by (46) as follows: 

inertial frictional spring 

H 2 + HY + c o 2 0 = 0 = {H - H+){H ~.HJ) (51) 

H + + 77_ = -T H+H- = ujj (52) 

Evidently, the spring term determines the product of the two Ts while the frictional term determines 
the sum. The product starts at zero for u 0 = 0 Not until the product of the H s get larger than a 
quarter the square of the sum does real oscillation begin when the Ts develop an imaginary part. At 
this point, the coefficients b± also become complex. While the mathematics of this is unambiguous, 
and it is very clear in the Mathematica file, I would like to be able describe this more physically. 

At any rate, it is clear that the nature of the trajectories described by (50) depends on the 
relative size of the two parameters, T and cu 0 . If T > 2cu 0 , the damping is large (this is called 
“overdamped”). In this case, both H + and H_ are real and negative, and the trajectory is a sum of 
decaying exponentials. 

If T < 2u,'o, the damping is small (this is called “underdamped”). In this case, both // and //_ 
have a negative real part and an imaginary part (with opposites signs). In this case, the trajectory 
oscillates (or circles in the complex plane), but also dies out with time exponentially in t. You can 
see this in the Mathematica file lecture-3-3.nb. 

At the boundary between these two cases — if T = 2u 0 , the system is “critically damped.” In 
this case, the general solution is 


x(t) = (A + Bt) e~ rt/2 


(53) 
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Harmonic oscillation for more degrees of freedom 

Because our analysis of the harmonic oscillator is very general, relying only on the general princi¬ 
ples of linearity and time translation symmetry, the result of (5) 

z(t) = z(0)e m (5) 

applies to any system satisfying these two principles. For example, it is not necessary to restrict 
ourselves to a single degree of freedom. With more degrees of freedom, z(t) becomes a vector 
with number of components equal to the number of degrees of freedom, as does z(0) in (5). Thus 
(5) implies that there are special solutions in which all the components of z move in lockstep, with 
the same angular frequency. Such a motion is called a “normal mode”. The same two steps suffice 
to solve these more complicated problems, but now there are more basis solutions (because there 
are more degrees of freedom) and each of the basis solutions describes a motion of ALL the parts 
of the system. We are not going to explore this in detail in this course. It is the starting point for the 
study of wave phenomena in Physics 15c. But it is such a useful way of talking about the motion 
of so many classical systems that I will spend a little time on one important example. 

The double pendulum 

Here is a very simple example of normal modes that I hope will make the idea clear. Consider the 
double pendulum, which looks like this: 



(54) 


Two identical pendulums are constrained to move in the plane of the paper and coupled together 
by a massless spring with spring constant K. In this case the configuration can be labeled by two 
numbers, x\ and a; 2 the displacements of block 1 and block 2 from equilibrium. Thus this is a 
system with two degrees of freedom. The vector q(t) that describes the configuration is just 



(55) 


Step 1 is now more complicated because we have to find the normal modes. Without the spring, 
the two pendulums would oscillate independently. For small oscillation, the oscillation of a single 
pendulum is harmonic with angular frequency u> = \j~gft. The spring couples these motions 
together. However, the normal modes are still harmonic. There are two basis solutions for each 
normal mode. 
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The normal modes look like this: 



There is one normal mode in which the blocks move together. In this mode, the spring in the 
middle is never stretched from its equilibrium length. The angular frequency of this mode is just 
the same as the angular frequency of a single pendulum, which (for small oscillations for which 
the system is linear) is U\ = \JgJi where l is the distance from the pivot to the mass. The basis 
solutions that describe this normal mode are 

z(t) = ( j ) eiuit and z(t) = ( } ) e~ iu}lt (57) 



There is another normal mode in which the two pendulums move in precisely opposite directions. 
The frequency of this mode is slightly higher than Jg/t, because now the spring contributes to 


the restoring force that produces the oscillation. In fact, in this case, u 2 = y 2iT + g/L The basis 
solutions that describe this normal mode are 



The thing about a normal mode is that the ratios of the displacements of all the parts of the 
system are fixed throughout the motion. In the double pendulum, in the motion (56), xi(t)/x 2 (t) = 
1 throughout the motion, while in (58), x\(t)/x 2 (t) = —1. 

You should have read a bit more about these in Morin’s book. In general, it is not easy to find 
normal modes. In a case like this, you can guess them just from the symmetry of the system. The 


10 





details of finding them in general involves heavy-duty linear algebra, and this can wait for Physics 
15c. What I care about in this course is that you know what these things mean and how to use them 
if someone tells you what they are. 

The point is that step 2 is the same as before. The general solution is always obtained by taking 
a general linear combination of the basis solutions: 

( lift) ) = Ci (1) ^ + Di ( } ) e~ iUlt + C 2 ( ) e^ + D 2 ( ) e~ iuJ2t (60) 

This is short-hand notation for two equations, 


Xl (t) = C x e iuit + D 1 e~ iiJlt + C 2 e iuJ2t + D 2 e _<W2t 
x 2 (t) = C x e iu}lt + D\ e~ iuJlt - C 2 e iuJ2t - D 2 e~^ 2t 


(61) 


As usual, we can use Euler’s equation to rewrite this in terms of sines and cosines. 

( iMf) ) = ( \ ) COSUJ it + ^ | j sin u\t + A 2 ^ cosou 2 t + B 2 ^ sin u 2 t (62) 

Again, this is short-hand notation for two equations, 


X\ (t) = Ax coscui t + Bx sin ui\t + A 2 cos u 2 t + B 2 sin uj 2 t 
x 2 (t) = Ax coscuif + Bx sinu;if — A 2 cos u> 2 t — B 2 sinu^f 


(63) 
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lecture 4 

Topics: 

Where are we? 

Conservation of Energy 
Work and Energy and the second Law 
Energy in the harmonic oscillator 
More degrees of freedom 
Forced oscillation and resonance 
Harmonic driving forces 
Energy in the driven oscillator 
Breaking the wine glass 

Where are we? 

We have now seen a number of examples of the use of F — ma. You should all be familiar at this 
point with the techniques of determining the trajectory of a classical system that is picked out by a 
given set of initial conditions. 

We also discussed the beautiful and surprisingly general behavior of the harmonic oscillator, 
and the particularly simple description of its motion that obtains when we allow our trajectories to 
involve i = \f~l. 

Now, we will start the process of going beyond F = ma to more general and powerful ap¬ 
proaches. This will occupy us for the next two weeks. This week, we will discuss one of the great 
conservation laws of classical mechanics — conservation of energy. 

I also have some organizational remarks. First, a bit of good news — we have now seen all 
the ways of solving differential equations that we are going to use. There are really only two of 
them. One is integrating, perhaps after moving things around a little bit to separate variables on 
two sides of the equation before integrating. The other is hoping that the solution is an exponential, 
plugging in and checking to see whether it works! Nothing more complicated is going to happen! 
We will, however, do a little more multivariable calculus today. Please please please stop me and 
ask questions if you see something you don’t understand. 

Now to work (literally and figuratively)! 

Conservation of energy 

It is somewhat unfortunate that “energy conservation” has come to mean two very different things. 
When we read in a newspaper about energy conservation, the article is usually about using energy 
carefully and not wasting it. What we mean in classical mechanics when we say energy conserva¬ 
tion is something very different. We mean that there is a quantity called “energy” that is unchanged 
with time for any possible trajectory of the classical system. And in fact this physical meaning of 
energy conservation seems to be a basic law that survives beyond classical mechanics into the 
quantum realm. I suppose that there is some connection between these two meanings. If energy 
were not conserved in the physics sense, if we could simply make new energy whenever we need 
it, then perhaps we would not have to be so worried about using it sparingly and efficiently. It is 
the fact that energy is conserved in the physics sense that makes it such an important quantity. 
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Today and next Tuesday, I will give some examples of the use of conservation of energy. In 
the weeks that follow, we will address the general issue of conservations laws more systematically 
and see where they come from at a deeper level. But before we can do that, we will have to leam 
more about why classical mechanics looks the way it does. This will require that we reformulate 
classical mechanics in a very beautiful way. And paradoxically, it will involve quantum mechanics 
as well. 


Work and Energy and the second Law 


Let’s begin by considering a single degree of freedom and a trajectory x(t) satisfying F = ma for 
some force law. We will often simplify the formulas by not writing the the (t) in x(t) explicitly, 
but it is important to note that we are considering not a random function, but a trajectory, and if we 
were to write things out in gory detail, F = ma would be the second order differential equation 
that x(t) satisfies: 

F(x(t), x(t), t) = mx(t) (1) 


Now we write F = ma as 


Multiplying both sides by v gives 



( 2 ) 


dv 


v m 


dt 


d 

dt 



F v = P 


( 3 ) 


We will define the kinetic energy of a particle of mass m to be v 2 , thus the left hand side of (3) 
is the rate of change of kinetic energy. The right hand side is another important quantity called the 
power, P, supplied by the force to the particle. Thus (3) is the differential form of the work-energy 
relation. The power supplied by the force acting on a particle is the rate of change of the particle’s 
kinetic energy. This is always true whether there is a conserved energy or not — it follows for any 
possible trajectory of the system, just from F = ma. Integrating (3) with respect to time gives the 
classic work-energy relation, that the change in kinetic energy equals the work done by the force 
on the particle: 


1 2 
-m v 9 
2 2 


2 

2 mv i = 


r * 2 d 


hi dt \2 


— -m v dt = 


rt2 


P dt = 


rt2 


v F dt = 


F dx 


it i 


( 4 ) 


IX i 


The left hand side of (3) is a total time derivative. If the product F v is also a total time derivative, 
then we can find a quantity that doesn’t change with time — its time derivative is zero. To see this, 
call 

d 

Fv = --U (5) 

dt 


then (3) becomes 


d 


dt \ 2 


d 


dt \2 


0 = — ( -mv ) — F v = — ( -mv ] + — U — — ( -mv + U 

dt 


d 


dt \ 2 


Then quantity in parenthesis doesn’t change with time and we give it a name, energy, 


( 6 ) 


E = -mv 2 + U 


( 7 ) 
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and we say that it is “conserved” because it doesn’t change with time. 

A simple way in which (5) can happen is for F to be a function only of x. Then we can take U 
in (5) to be 

U(x) = - r F( x ') dx' (8) 

•Ixo 

Physically, this makes sense because the integral in (8) keeps track of the work done by the force 
in moving from xq to x. If this is negative, then to make the system move in this way, we would 
have to do work against the force, and would get that energy back by letting the system move back 
to xq. Mathematically this works because we can use the chain rule to write 


d 

dt 


U(x) 


dx d 

— — U(x 
dt dx 


dx d 
dt d x 



F(x') dx' 


—v F(x) 


(9) 


So that v F is a total time derivative, 


vF(x) = ~U(x) (10) 

dt 

In this case, as I’m sure you know, U(x) is called the potential energy, and the force is just minus 
the derivative of U, 

F(x) = —^U(x) (11) 

dx 

Note that it is crucial that U depends only on x. Mathematically, you can see that Fv = 
—dU/dt doesn’t work if U is a function of v or t. If U depends on v (like a frictional force), time 
derivatives of U give terms that depend on the acceleration a - which the force is not allowed to 
do. If U doesn’t depend on v but does depend on t the derivative with respect to t has pieces with 
no factor of v. You need U to have only x dependence to always get the factor of v from the chain 
rule in (9). 


Energy in the harmonic oscillator 

For the harmonic oscillator, the potential energy is 

U(x) = - I" Fix') dx’ = - f X i-Kx’) dx’ = \k x 2 - \kx\ (12) 

We can take the starting position x 0 to have any fixed value, so we might as well start at the 
equilibrium position, x 0 = 0 so the second term goes away. The second term in (12) doesn’t do 
anything anyway, because it is a constant independent of x so it just drops out when we differentiate 
to get the force. Thus we can drop the second term and take the conserved energy in the harmonic 
oscillator to be 

E = -mi 2 + K x 2 (13) 

2 2 

It is instructive to see how this works for a general solution of the form 

x(t) = ccos(u;t — 0) (14) 

Putting (14) into (13), we get the following for the energy: 

E = ^ mu 2 c 2 sin 2 (cut — 0) + ^K c 2 cos 2 (c ot — 0) (15) 
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Since u = \JK/m, this is 

E = c 2 sin 2 (cut — </>) + ]^K c 2 cos 2 (cut — 0) = c 2 (16) 

which is a constant, as we expected. But notice the way the mathematics manages to encode the 
sloshing back and forth of energy between kinetic and potential. 

More degrees of freedom 

Consider a system with two degrees of freedom with the configuration specified by the two co¬ 
ordinates x\ and x 2 with masses mi and m 2 respectively. Denote the forces on masses 1 and 2 
by F\ and F 2 . We will begin by considering the situation in which the forces depend only on the 
coordinates, not on their derivatives, so that that Newton’s second law is 


miXi = Fi(xi,x 2 ) m 2 x 2 = F 2 (xi,x 2 ) 


(17) 


With one degree of freedom, we saw that there is automatically a conserved energy if the force 
depends only on position. Here we will see that the situation is more complicated. To see what the 
issues are, let’s compute the time derivative of the kinetic energy, 

^ Qmii; 2 + ^m 2 x = miiiXi + m 2 x 2 x 2 = xiFi(xi,x 2 ) + x 2 F 2 (xi, x 2 ) = P (18) 


P is the power fed into the system by the force. If the right hand side of (18) is the time derivative 
of something, 


x 1 F 1 (x 1 , x 2 ) + x 2 F 2 (x 1 , x 2 ) 


? 



(19) 


then we can define an energy that doesn’t change with time 


E = -miij + -m 2 x \ + U 


( 20 ) 


To see when there is a U satisfying (19), we need an important result from multivariable cal¬ 
culus. The change in a function of several variable can be written as a sum of the changes in the 
variables times the corresponding partial derivatives: 


,,, , , df df df 

df(a, b, c, • • •) = da — + db — + dc — + 
da ob oc 


( 21 ) 


The reason is not too hard to understand. To find the change in / due to a change in a with the 
other variables held fixed, you would multiply the change in a by the derivative of / with respect 
to a with the other variables held fixed (which is what the partial derivative means) to get the first 
term in (21). Similarly, to find the change in / due to a change in b with the other variables held 
fixed, you would multiply the change in b by the derivative of / with respect to b with the other 
variables held fixed to get the second term in (21). And so on! Then the total change in / is just 
the sum of all of these independent changes. When you say it in words rather than writing it down 
mathematically it makes perfect sense. If you want to know how something changes, you have to 
add up all the possible sources of change! 


4 



We can divide (21) by dt to get the total time derivative, 

d f( ; \ • d f , j d f , • d f | 

-f(a,b,c,---)=a Ta+ b- + c Tc+ ... ( 22 ) 

Here is an example of (22). Consider the volume of a cylinder with height h and radius r. 



if both r and h are changing in time 


dV . dV ■ dV 

-dF~ r dF + h dh 


r 2 tt r h + 



change in r change in h 


(23) 


(24) 


Now applying (22) to (19), we see that U had better not depend on derivatives of x\ and x 2 . If 
it did, (22) would give terms involving x\ and x 2 , which we don’t want. And it should not depend 
on t, because that would produce a term without an x\ or x 2 because t — 1. Thus we want 


„ . , „ . . d TT , . dU dU 

x 1 Fi(x l ,x 2 ) + x 2 F 2 (x i,x 2 ) = ~—U(xi,x 2 ) = -xx — - x 2 


which works if 


Fi (xi,x 2 ) = 


dU 


and F 2 {x i,x 2 ) — 


dU 


(25) 


(26) 


d Xl v y dx 2 

There is a conserved energy if we can get the force by taking partial derivatives of a single function. 
The function U is the “potential energy.” 1 When the forces have the form (26), you can show that 
the work done by the forces in a motion of the system from one configuration to another is the 
difference between the initial potential energy and the final potential energy and therefore does not 
depend on how the system gets from the initial configuration to the final configuration. 

Let’s do an example of a collection of springs and two masses sliding on a straight frictionless 
track, as shown in (27). 

| Xi —> | x 2 —> 

K\ , _, K 2 r _, K, 




i 


2 


misism 


(27) 

We can take the coordinates x\ and x 2 to be the (signed) distance along the track from the equilib¬ 
rium positions of the two masses. The forces are 2 


Fi(xi, x 2 ) = —Kixi - K 2 (x i - x 2 ) and F 2 (x i, x 2 ) = —K 2 {x 2 - xi) - K 3 x 2 (28) 

'There could also be contributions to the force that depend on ii and x 2 , but which cancel in the combination 
XiFi(x\, x 2 ) + x 2 F2(xi,X2). This is the way the magnetic force works. 

2 As always, the direction of “positive” force is the direction in which the coordinate is increasing. 
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As we might guess from our previous discussion of a single spring, if we choose 


K\ 2 , I < 2 


,2 K , 


U(x 1 ,x 2 ) = — X 1 + — (xi -x 2 ) + — 

then (26) is satisfied. So (29) is the potential energy and the total energy, 

X X 

E = -m\x\ + -m 2 x \ + U(x i,x 2 ) 


( 29 ) 


(30) 


is independent of time for any trajectory. 

Nothing much changes if the number of degrees of freedom is greater than two, we just have 
to keep track of more variables. For n degrees of freedom, if we call the coordinates x 3 for j — 1 
to n, the kinetic energy is 


m 




The rate of change of (31) is 

d 1 n 


dt 


E Tflj . o 

—x~ 
9 3 

3 =1 Z 


3=1 


= J2 rrijXjXj = Xj F j{x u 

3 = 1 i =1 


,Xr, 


and if the force on the jth mass is 

Fj(x i, 

then the total energy 


d 


,x n = 


dx- 


U(x i, • • • ,x n ) 


Y , 1V ^ x2 3 +U{x U ---,X n ) 


(31) 


(32) 


(33) 


(34) 


\3= 1 


is conserved — that is independent of time for any allowed trajectory, 


xi (f), • • • ,x n (t) 


(35) 


Forced oscillation and resonance 

Suppose we “drive” our damped harmonic oscillator by adding a time dependent force, so the 
equation of motion becomes 


(S5 + r J t + “'») (36) 

Let’s begin by discussing the linearity of this equation of motion. Because of the force term, 
the situation is a bit different from that of an unforced oscillator. Suppose that I have a solution to 
this equation, x\(t) and another one x 2 (t). 

Now if I add them together, I don’t get a solution to the same differential equation 
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Xi (t) = Fi(i)/m 


1 dt 2 

I d 2 

\ d:t 2 


r 4 

dt 

dt 



x 2 (t) = F 2 (t)/m 


(^ + r S + "'“) + 

= [aFi(t) + bF 2 (t)\/m 


( 37 ) 


In words, which are not very useful in this case, when we take linear combinations of the solutions, 
we must also take the same linear combinations of the driving forces, and vice versa. 

In particular, this means that can we always add a solution of the homogeneous equation, with 
no external force. 


jp + 2 r s + "») Ii(f) = Fl(f)/ro 

Mt) = 0 


(38) 


[axiit) + bx 0 (t)\ = aFi(t)/m 

We can use this form of linearity to simplify the problem. 

Harmonic driving forces 

Life is much simpler if we look at forces of the form 3 

F(t) = F 0 (39) 

where c Od is called the “driving frequency.” Why is this an interesting thing to do? This force is 
exponential — and therefore behaves very simply under time translations. 

F{t + a) = e~ iUda F{t) (40) 

Thus we can look for solutions that are proportional to e~ luJdt . There will also be terms in the 
general solution which are just like those we found for the undriven oscillator. We can always add 
these solutions because they do not contribute to the driving term. Thus the general solution will 
be of the form 

z(t) = Ae~ iu}dt + b + e~ v+t + 6_e" r ~ t (41) 

where the 

e~ r±t (42) 

3 We can actually use linearity to write any force as a linear combination of forces of this form using the mathemat¬ 
ical technique of Fourier analysis. So if we solve the problem for all values of ujd, we have actually solved it for all 
reasonable forces. 


' d 2 d 2 

dp +T Jt +u, \ 
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with 


r± = 



(43) 


are exponential solutions to the unforced oscillator problem. Just as in the case without damping, 
the coefficients of the “homogeneous” solutions must be set by initial conditions, but that is not 
true for A. It does not depend on initial conditions! This is what is called a “particular” solution 
to the differential equation, and the general solution will also involve solutions to the equation 
without any force, and the coefficients of the solutions will have to be determined by the initial 
conditions. 

One nice thing about the form (41) is that in certain cases, we do not care about the initial 
conditions. This is because the homogeneous solutions die out exponentially so long as there is 
any damping at all. If we wait long enough, only the term proportional to A survives. 

Now let’s compute A. This is straightforward because we are working with exponentials. 


' d 2 ^ d 2 n 

dv +r it + ‘ J \ 


Ae~ iuJdt = F, 


o e 


-iudt 


jm 


-Ud 

A = 
A = 
A = 


iTujd + cJq) A — — 

m 


F 0 /m 


uZ - iYuj d - u 2 . 


M 

F 0 /m 


uZ -lo 2 , - iYuja 


J d 

F 0 /m 


OJn — (jJa — iTuJ, 


A = 


Fo/m 


A = 


d ~ ^ ^d 

\ ( ~ Ud + iFud 

d ~ <'■*■ ) V^o — u d + 

(ot'o - u d + iVud) F 0 /rn 


, cun — uj 2 , — iTu, 


(44) 

(45) 

(46) 

(47) 

(48) 


K 2 - ^If + r 2 ^j 

Because we used the exponential solution, we got the solution just using algebra. It’s oc F 0 . 


(49) 


A = 


{ul - uj + iTu d ) Fo/m 


K 2 -^) 2 + r 2 ^ 

^0 - Vd) F o/' m 


= A + iB 


A = 


B = 


H ~ ujf + T 2 u 2 

Yu d Fo/m 


H - uj ) 2 + r 2 u 2 d 

Now what do we do with this complex solution. We are not really interested in the 
force we started with. However, we are interested in forces of the form 


(50) 

(51) 

(52) 
complex 


F 0 cos co d t = Re(F 0 e luJdt ) 


(53) 
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This describes a real harmonic driving force. Now the point is that because of linearity, we can 
find the solution for the real part of the force by just taking the real part of our complex solution. 
Taking the real part gives 


x(t) = Re [Ae = A cos LU d t + B sin uj d t (54) 

Note the phase relations. The first term is in phase with the force if cUq > cthat is when the 
system is driven slowly. In this limit, inertia is irrelevant, and the mass just moves along with the 
driving force. 

If cug < orj, when the system is driven rapidly, the first term is 180° out of phase with the force. 
In this limit, inertia dominates. 

In between, for Uq = cthe second term is crucial. It is 90° out of phase (behind) the driving 
force. This is illustrated in Mathematica file lecture-4-l.nb. 

A and B are called the elastic and absorptive amplitudes for reasons that we will discuss shortly. 

Here are graphs of A and B for T/u) d = 0.3, 0.1 and 0.05 For larger values of r/cj d , the 
resonance hardly shows up at all. 


Each of these starts at 



F 0 

muiQ 


for small damping, and looks like 

Fp/m 

u'o-uj 


except near resonance. 

Now, look at B for the same three values of T 


(55) 

(56) 
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Notice that as the T/cuo decreases, the resonance gets sharper and the effect of the B term is 
more and more concentrated near the resonance. 

Energy in the driven oscillator 

To see why A and B in (51) and (52) are called the elastic and absorptive amplitudes, consider 
the work done by the driving force. The power, P(t), which is the work per unit time done by the 
driving force, is the force times the velocity of the system on which it acts — 

d 

P(t ) = F(t) x(t) = F 0 cos u d t ■ — (A cos Udt + B sincu^f) (57) 

Note that the power is a nonlinear function. For example, if F 0 doubles, both Fit) and x(t) double, 
so the power quadruples. Because of this nonlinearity, we cannot use the complex form for x(t) 
because we could get contributions to the power from both the real and imaginary part, which 
is not what we want physically. We must use real form for x(t), which is what we have done. 
Continuing, 



= P(t) = F 0 cos Udt ■ (—aid A sin Udt + (Vd B cos LVdt) 

(58) 


= —F 0 UdA cos Udt sin u>dt + F 0 UdB cos 2 to d t 

(59) 

The first term in (59) averages to zero over a complete half-period of oscillation because 



1 

cos Udt sin ujdt = ~ sin 2c Odt 

(60) 

and 

fto+ir/LOd 1 , . , 

J sin 2u> d t dt — ^- cos 2u d t 7T ' UJd = 0 

(61) 


This is why A is called the elastic amplitude. If A dominates, energy that goes in comes back 
out, like an elastic. 

The second term is always positive — it averages to 


-^average ^ Po^dB (62) 

B is called the absorptive amplitude because it measures how fast energy is absorbed by the 
system. P ave rage, is maximum on resonance, at cu 0 = Xd, Furthermore, if the damping is small, the 
peak is very very sharp. This is one good way to find resonances. 

Breaking the wine glass 

The concept of energy is quite useful even in situations where the energy is not obviously con¬ 
served. Frictional forces, for example, eat up the energy. In fact, as you probably all know, the 
energy doesn’t go away, but it is converted into heat, and this process cannot be easily reversed, so 
that we cannot get the energy back into kinetic or potential energy. Thus for example in a damped 
driven harmonic oscillator started from rest, at first, the work done by the external force goes into 
increasing the amplitude of the oscillation, and the energy sloshes back and forth between kinetic 
energy and potential energy. But as the amplitude and therefore the velocity increases, more and 
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more of the power is eaten up by friction. When the system reaches its steady state and the free 
oscillations have died away, all of the work done by the external force is eaten up by friction (at 
least when averaged over a complete cycle of oscillation). 

I will end with a demonstrations that illustrates quite dramatically the process of resonance in 
damped, driven oscillation. We will do this in a system with many degrees of freedom — a crystal 
wine glass. The wine glass is a very complicated system, but its physics is translation invariant 
and for small oscillations, like almost any other oscillator, it is approximately linear. Thus it has 
normal modes — motions in which all the parts oscillate with a single frequency. Each of the 
normal mode frequencies corresponds to a particular way in which the wine glass can ring. The 
lowest frequency is what dominates if we just ring it. 

So what we are going to do is to feed energy into the wine glass at the frequency of its lowest 
ringing mode. For a good wine glass, the damping is really pretty small — it rings for a long time 
— so we can feed in a lot of energy and get the lowest mode very excited. So we will actually 
see how the wine glass is deforming as it rings. In fact, of course, we wouldn’t be able to do this 
without some help, because the frequency is very high, and the motion of the glass would just 
look like a blur. But what we can do is to use a strobe light tied to the driving frequency so that 
we illuminate the glass at the same point in the cycle each time, or at least very nearly so. The 
astonishing thing to me about this demonstration is how much the glass actually deforms before it 
breaks. If you have never seen this before, I think that you will also be surprised — so here goes. 

break the wine glass 
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lecture 4 

Topics: 

Where are we? 

Work and Energy in three dimensions 
Examples of potentials in 3-dimensions 
1/r 2 forces and field lines 
The force between spheres 
A particle on a frictionless track 


Where are we? 


Work and Energy in three dimensions 

One of the most useful and interesting applications of the concept of energy is to the motion of a 
single particle in three dimensions. In this case, the configuration of the system is described by the 
position vector of particle, r, and the velocity, the acceleration and the momentum are also vectors, 

v — f, a — f and p — mv. 

In a sense, we have already dealt with this because we have talked about system with more than 
one degree of freedom, and the particle in three dimensions is just a particular example of a system 
with three degrees of freedom. But it is a very important one, and the fact that the coordinates 
form a vector in three dimensional space has important consequences for both the physics and the 
notation. 

In three dimensions, Newton’s second law is a vector equation, 

ma = mv — mr — p — F (1) 


We will have much more to say later about what it means to say that something like (1) is a 
vector equation. For now, we will simply say that a 3-dimensional vector is an object with three 
independent components, and we will denote the component of r by (x, y , z) as you probably did 
in high school, and the components of an arbitrary vector A by (A x , A y , A z ). A crucial concept 
we will use often is the “dot product” of two vectors: 

A ■ B = A X B X + A y By + A Z B Z (2) 


The dot product is important because it has the same value even if we change the components of 
the individual vectors by rotating to another coordinate system. Here are a few of its most useful 
properties: 


The length of a vector A is | A j = y A ■ A 
A ■ B — \A | \B | cos Oab where 9ab is the angle between the vectors 


\A 


A ■ B = | B | cos 0\n is the component of B in the direction of A 


(3) 

(4) 

(5) 


Now let’s go back to (1). 
product of both sides with v, 


To get the time dependence of the kinetic energy, we take the dot 


mv 


d 

dt 


mv—mv-a — v-F = P 


( 6 ) 
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This is just the work-energy relation again. The only differences between this and the general 
situation for many degrees of freedowm is that all three masses are the same, and because of that, 
the sums can be replaced by dot products. This means that in the power, only the component of 
the force in the direction of the velocity contributes. 

Again, as with one degree of freedom, the time rate of change of the kinetic energy is equal to 
the power, P = v ■ F, fed into the system by the force. There is a conserved energy if the power is 
a total time derivative of a potential: 

v-F = -^U (7) 

at 

But now because the coordinates are the components of a vector, the condition that the forces are 
the partial derivatives of a single function U can be written as 

F(r) = U(r) = - V U(r) (8) 

or 


where 



is the “gradient” or “grad” of “div, grad and curl,” a vector with components 


(9) 


( d_ d d\ 
\ dx ’ dy ’ dz J 


( 10 ) 


We won’t make a lot of use of this. You will do more in 15b. But in this case, there is a nice 
geometrical way of understanding why it is that a force of this kind allows you to define a conserved 
energy. The work done by the force in going from one point in space to another is independent of 
the path — it just depends on the starting point and ending point. This is easy to prove, but we 
won’t bother — you will see it later. 

The second interesting thing about energy in three dimensions is that we can find velocity 
dependent forces that still lead to a conserved energy in a very natural way. As long as the force is 
perpendicular to v, v ■ F is zero, and therefore it is the time derivative of zero, and (7) is satisfied. 
One important way to do this to use the cross product (which we will discuss in more detail in a 
few weeks). A force of the form 

F = v x B (11) 

automatically satisfies v ■ F — 0. This is the way magnetism works. 


Examples of potentials in 3-dimensions 

It is easy to go from a potential to the force. Equation (8) says that you just differentiate. In fact, 
this step is so easy that we will often talk about finding the potential by running this backwards — 
that is finding a U(x) such that (8) gives the force we want. 

A familiar example of potential energy is the energy of a massive particle in the earth’s gravita¬ 
tional field. Near the surface, the field is roughly constant and vertical, and the force is downward 
with magnitude m g. The potential energy is therefore m g z, so that minus the z derivative is the 
z component of the force, — m g, and the other components vanish. 

U(r) — mgz (12) 
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( 13 ) 


F(r) = (0,0, -mg) 

For another example, consider the potential 

U (r ) = 1 K r 2 = ^ Ji (; x 2 + y 2 + z 2 ) (14) 

To get the a;, y and z components of the force, we just differentiate and change the sign: 

F(r) = ( -K x,-K y, -K z ) (15) 

This is a vector that points back towards the origin, so this describes a particle that has a stable 
equilibrium point at the origin. This is called the three dimensional harmonic oscillator potential. 

A more interesting question occurs when you are given a force and are asked to find the poten¬ 
tial. It turns out that this is possible if and only if the components of the force satisfy 1 


So for example, the force 
is associated with a potential because 

d_ 

dy 

6 )_ 
dz 

d_ 

dx 





Fir) = 


Wz F « 

dx z 

Y Fx 

dy 


0 

0 

0 


(yz, xz, xy) 


F z 

F x 

Fy 


dz Fy 

dx z 

— F 

r\ ± X 

dy 


= x — x = 0 
=y-y =0 
= z-z = 0 


(16) 


(17) 


(18) 


The “only if” part of (16) is easy to understand. If the components of F are minus the deriva¬ 
tives of a potential, (16) must be satisfied because the order of differentiations doesn’t matter. The 
“if” part is less obvious. It turns out that if (16) is satisfied, you can find the potential by integrat¬ 
ing the force, and it doesn’t matter what path you choose to integrate along. This is discussed in 
Morin’s book, and you will see much more of it in Physics 15b. That is enough for now. 

'As you will see later, this can be written more compactly using the cross product or curl as V x F(f) = 0. 
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1/r 2 forces and field lines 


Let us start our discussion of gravity by talking about what is so special about 1/r 2 forces? I will 
talk both about gravity and also the 1/r 2 Coulomb force between electrically charged particles. 
Gravity is really quite different from the Coulomb force. But the mathematics is pretty similar. The 
potential energy associated with the gravitational force between two massive particles in Newton’s 
theory of gravity is 




G mi m 2 
In - r 2 \ 


(19) 


where fj and r 2 are the positions of the particles and m± and m 2 are their masses. The force on 
particle 1 from particle 2 is 


.TVrom m 2 — 

on 


d 


df\ 


V (fj ,r 2 ) = — G mi m 2 


Ti - r 2 

In - r 2 1 3 * 


( 20 ) 


Because the potential energy is invariant under space translation because (19) just depends on the 
vector 

r \2 = n - f 2 (21) 

from f 2 to fj, the satisfies Newton’s third law: 

_ # ^ ^ \ _ * 

F from mj = \ (v 1 , V 2 ) — G TYll m 2 T~Zl „ = Tfrom m 2 (22) 

on m 2 or 2 |ri — r-z] 6 on rn x 


It is also no surprise that the force vector in (20) points from particle 1 to particle 2 because the 
potential just depends on the distance between the two particles and is therefore invariant under 
rotations in addition to space translations. So the force has to be proportional to fj 2 because there 
is no other vector in the problem. Then because the force is attractive, pulling particle 1 towards 
particle 2, the force is a positive factor times — fj 2 = r 2 1- 

The potential energy associated with the Coulomb force between two charged particles is (in 
cgs units) 


Qi Vi 
|n - r 2 


(23) 


where qi and q 2 are the charges. Clearly the dependence on the positions is exactly the same. 
Both potentials lead to a 1/r 2 force. So we can understand some of the special properties of 
the gravitational force by thinking about the Coulomb force instead. Before we start, perhaps 
I should also emphasize the differences. The most obvious difference is that while the electric 
charge can have either sign, the mass is always positive, so the gravitational force from (23) is 
always attractive. Less obvious, but even more important, is the effect of the fact that the strength 
of the gravitational force between two masses is proportional to the masses themselves. 2 This has 
far-reaching effects, a few of which we will see later. 

Now for the similarities and special properties. Newtonian Gravity and the Coulomb force 
share the crucial property of linearity. 3 The force on a particle due to an arbitrary number of 


2 Note that we are talking here about Newtonian gravity rather than Einstein’s General Relativity. Both of these 
statements must be made more carefully in General Relativity, as we will glimpse after break. 

3 Again this is true of Newtonian gravity but not General Relativity. In General Relativity, nonlinearities are present 

because the gravitational field carries energy, which in turn produces gravity. We will come back to this later. 
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masses or charges is just the sum over the forces from each of the other masses or charges. For 
gravity 


on m i 


— ^ ^ .Z^from rrij — (~T Tfl^ 

m onmi 




r i — rj 


\n 


(24) 


We can extend this in an obvious way to continuous distributions of masses or charges. The 
sums simply become integrals. In fact, as far as we can tell today, all the masses and charges in 
the universe are essentially point masses and charges. So our integrals really are approximations 
to sums. But the individual masses and charges are so small that the so-called continuum ap¬ 
proximation, in which we replace a collection of lots of tiny masses or charges with a continuous 
distribution is often an essentially perfect approximation. 

One thing I want to do today is to go over Newton’s theorem. It is sufficiently important that 
it doesn’t hurt to look at it several times. In fact, you will see it again in a different and more 
sophisticated form if you take Physics 15b next semester. The theorem is that the force on a point 
mass or charge due to a uniform shell of mass or charge is zero inside the shell and outside the 
shell is just what you would get if all the mass or charge were concentrated at the center. I am not 
so much interested in a rigorous proof as in making it obvious why the theorem is true. 

The key idea that we will use to understand the theorem is something we discussed when we 
talked about the electromagnetic force. The idea there was “field lines.” The electromagnetic force 
can be thought of in the following way. One charge produces an electromagnetic field that affects 
other charges, producing the forces (of course, all charges produce these fields, and we don’t talk 
about the effect of the part of the field due to a given charge on the charge that produces it — 
we will come back to this later in the course). The electric field, in turn, can be thought of as 
associated with field lines, which begin on positive charges and end on negative charges — they 
never begin or end in empty space — only on charges. The field lines point in the direction of the 
electric field, and the density of the field lines is proportional to the strength of the electric field. A 
single charge sends out field lines symmetrically in all directions. Since the lines can never end if 
there are no other charged particles around, they continue to spread. On a surface of radius r, the 
lines are spread out over an area rr 2 . Thus the density of the field lines falls off as 1/r 2 . Thus 
the electric field and therefore the force falls off as 1/r 2 . This is the basic reason for the Coulomb 
force. 
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For gravitation, field lines are very similar. Since there is only one kind of charge (which is just 
mass) and like charges attract, we can think of field lines as coming in from infinity and ending on 
masses. Like electric field lines, gravitational field lines cannot end where there is no mass. So all 
the rest of the argument goes through in the same way. 



Now what does this mean for Newton’s theorem? 

First let’s show directly that there is no force inside a shell on a point mass at P inside the 
uniform sphere of mass and look at the the force from P on some infinitesimal area A anywhere 
on the surface of the sphere. The point is that we can construct an infinitesimal area A' on the other 
side of the sphere by drawing a straight line from every point on the boundary of A through the 
point P. The locus of points where these lines intersect the sphere on the other side is the boundary 
of A'. 



Now we can show that the force on A' is equal and opposite to the force on A. 

We can see this geometrically, using the fact that the force goes like 1/r 2 . This is done in 
problem 5.10 of Morin’s book. But field lines make it even more obvious. First look at the force 
on little areas bounded by the cone at the positions of A and A' but perpendicular to the line 


6 



between A and A'. 


( 26 ) 


A' 



A 


The number of lines through each area is the same, and since the strength of the field is proportional 
to the density of the lines and the mass of the little regions is proportional to their area, the number 
of lines is proportional to the force. 

Now the areas of A and A' are larger than this because they are slanted. But both are slanted 
at exactly the same angle because the triangle formed by the center of sphere and A and A' is 
isosceles, and the tangents to the sphere are perpendicular to the radii to A and A!. 



(27) 


Thus both areas are larger than the area of the corresponding perpendicular regions by the same 
factor of 1/ cos 0. The the forces are large by the same factor and they are therefore equal and 
opposite. 

But by Newton’s third law, the forces on P from the two regions A and A' are equal and 
opposite. 

This shows that for every infinitesimal area on the spherical shell, there is a region on the other 
side that produces an equal and opposite force at P. This is not quite enough, because we need to 
be sure that when we integrate over the whole sphere, each of these little areas appears only once. 
Fortunately, this is easy. We can consider a plane through P and the center of the circle. Then we 
can find the total force at P by integrating only over the areas on one side of the plane. The areas 
that cancel the force are all on the other side of the plane. Thus the whole sphere is accounted for, 
and we have proven the first part of the theorem. 

Consider now the field outside a uniform shell of mass. This mass distribution has spherical 
symmetry. Let us first show that this implies that the force must be radial and equal in all direc¬ 
tions. First consider the force at some point r. This force must be the same if we rotate the mass 
distribution around the f axis, which implies that it must be in the f direction because any trans¬ 
verse component would rotate when we rotate the mass distribution. Once we know that the force 
is radial, it is clear, again because of rotation invariance, that it is the same in all directions, with 
the magnitude being only a function of \ f\. 
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Since the force vanishes inside the shell, there is no gravitational field, and thus no field lines 
inside. All the field lines go from infinity to the shell. But the total mass produces the same number 
of field lines, whatever the radius of the shell. This must be true because very far away from the 
shell, it looks just l ik e a point mass, so the number of gravitational field lines coming in from 
infinity must be the same independent of the radius of the shell. But the field line cannot appear or 
disappear outside the shell, because there is no mass there. Thus outside the shell, the gravitational 
field is completely independent of the radius of the shell, and therefore the same as that produced 
by a point mass at the center. This finishes the theorem. 

There are many important consequences of Newton’s theorem. One that we will come back 
to soon is the gravitational force on a point mass m inside a spherically symmetric distribution of 
masses centered at the origin. If the point mass is at r from the center of the distribution, Newton’s 
theorem implies that the force is 


—f 


GmM 


(28) 


where M is the mass inside r. This will be very important for one of the more mysterious facts 
about the universe — the existence of dark matter. 

You can think about these arguments involving field lines in two different ways. What is more 
fundamental - the field lines or the 1/r 2 force? The straightforward way to interpret this is to say 
that we know that the force goes like 1/r 2 and that allows us to show that the field line picture 
makes sense. Alternatively, we might say that the field lines are the fundamental things, and the 
field line picture implies (because the field lines spread out over the surface of a sphere which 
grows like r 2 ) that the force falls off like 1/r 2 . In fact, what we believe from a more fundamental 
description of these forces is that the field lines give the more fundamental description. This 
doesn’t matter at all if we stick to three dimensions. But it will be important when we talk later 
about theories of extra dimensions. In n dimensions, because the area of a sphere of radius r 
increases like r n_1 , the electric and gravitational forces fall like l/r n_1 ! 
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The force between spheres 

Newton’s theorem implies the gravitational force between two spherically symmetric bodies is the 
same as if all the mass were concentrated at their centers. This follows from linearity and Newton’s 
third law. Consider two spherical bodies, A and B. The force on each of the point masses in B 
due to A is the same as the force on due to a point at the center of A with the same total mass. But 
by Newton’s third law, this means that the force on A from each of the point masses in B is the 
same as the force on a point mass of the center of A. But we know that this is the same as a force 
between a point mass at the center of B and a point mass at the center of A. 

This means, among other important things, that spherical bodies exert no torques on one an¬ 
other. 

Before we leave (for now) the subject of field lines, I want to show you a couple of plots. 
One set of field lines that you are probably familiar with is the “dipole” field between positive and 
negative charges. It looks something like this in terms of field lines: 



Gravity is different. Because the charges are always the same, the field lines around a pair of 
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equal masses look this (same a electric field lines around a pair of equal negative charges): 



The point in the middle is characteristic of a gravitational tidal force. Right in the middle, there is 
no force. But above or below this point, you are pulled towards the middle, while on either side of 
the middle, you are pushed away from it. We will come back to this in a couple of months. 

A particle on a frictionless track 

A system with one degree of freedom need not be one dimensional. You have seen examples of 
this in the first problem set. Another nice one is a particle on a frictionless track in the earth’s 
gravitational field. Suppose a particle of mass m is constrained to move along a Motionless track 
that curves around in 3-dimensional space. Because the particle is constrained to be on the track, 
the configuration of the system is completely specified by the position on the track. It is convenient 
to measure this in terms of the actual distance of the particle along the track from some reference 
point on the track, which we will call i{t). In terms of i{t), we can specify the position of the 
particle in 3-dimensional space if we have mapped out the x, y and z components of each point on 
the track as a function of i. In other words, the vector r(£) describes the shape of the track, and the 
single variable £ describes the position of the particle on the track. 
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We haven’t learned any very simple way to directly write down the equations of motion for 
such a particle (we will next week). But we can write down the equation for energy, which should 
be the sum of the kinetic energy and the potential energy due to the earth’s gravity. 

The speed of the particle is just £, so the kinetic energy should be 

(29) 

and the potential energy comes from the earth’s gravitational field, 

U — mg z(£) (30) 

so that the energy is 

^ m (£) 2 + mg z(£) (31) 

If you think about how difficult it would be to write down the equation of motion and solve it using 
forces for a complicated shaped track, you will be begin to appreciate the power of conservation 
of energy. 
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lecture 6 

Topics: 

Where are we now? 

Newton’s second law and momentum 
The third law 
Rocket motion 

The Lagrangian and Euler-Lagrange equations 
A better horizontal swing 


Where are we? 

Last week, we discussed conservation of energy. I will talk about another of the great conservation 
laws of classical mechanics — conservation of momentum. We will see how it emerges from 
F = ma and discuss some of its uses, discuss the notion of scattering. 


Newton’s second law and momentum 

Newton’s second law for a single particle of mass m can be written as 

i? = rfp 
dt 


( 1 ) 


where the quantity p is the momentum of the particle, and is given in Newtonian mechanics by 

p = rrw . (2) 


The form (1) actually turns out to be a better and more general way of writing the second law than 
the familiar F = ma. In this form, (1) [but not (2)] is true even when the speed of the particle 
approaches the speed of light, where as we will see in a few weeks, many of the common-sense 
aspects of mechanics begin to break down. In addition, as we will see, (1) often allows us to deal 
more easily with situations in which objects come apart, or coalesce. 


The third law 

So if force is always changing momentum according to (1), how is it that momentum is conserved? 
The answer that you probably learned in high-school is Newton’s third law. For every action, there 
is an equal and opposite reaction. If thing 1 produces a force on thing 2, then thing 2 produces 
a force with equal magnitude and in the opposite direction on thing 1. If this is correct, then any 
change of the momentum of something is always compensated by a change in the momentum of 
the things that are producing the forces on it. The total momentum of any isolated system that has 
no external forces acting on it is always conserved. For now, you should just accept this. Later in 
the course (starting next week) we will talk more about why it is true. Here, we will be content to 
see how it is useful. 
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Rocket motion 


The most important uses of conservation of momentum all have a couple of things in common. The 
first is that using momentum conservation allows us to avoid thinking about incredibly complicated 
details of how the forces work that turn out to be irrelevant in the end. In fact, I personally often 
find (2) very confusing in situations where the system is changing. Trying to figure out the forces 
can be very confusing. 

It is usually easier and more reliable to make progress instead by comparing the system at two 
different times and using the fact that the momentum is the same. These two times may be far 
apart or close together. Sometimes we are interested in the initial condition of a system and the 
final condition, and we don’t much care about what happens in between. But sometimes, we are 
interested in comparing times that are very close together to use conservation of momentum to 
analyze the dynamics of the system. In the latter case, we can almost always analyze the problem 
by looking at the difference between the system at time t and time t + dt. A good example of 
using momentum conservation to simplify the analysis of dynamics is rocket motion. The nozzle 
of a rocket engine is a very complicated system. There are lots of forces acting on it as the rocket 
fuel explodes in the nozzle and is forced out at high velocity. If you had to understand in detail the 
forces acting on the stuff that is ejected from a rocket engine to see how the rocket would move, it 
would be an impossible job. 

But the point is that you never have to mention force at all. We are not interested in the force. 
There is no good way to measure this force directly. So get rid of it. Conservation of momentum 
ensures that all you need to know is the velocity, u, of the ejected material, and the rate at which 
mass is being ejected, dm/dt. You can simply figure out the rate of change in velocity of the 
rocket, dv/dt, by requiring that momentum be conserved. 

First suppose that the rocket is at rest at time t — 0. Then at t — 0 the momentum of both 
the rocket, p f , and the momentum of the fuel, p f , are zero. An infinitesimal time dt later, the 
momentum of the ejected material is p f = —u dm and this must be compensated by the change in 
momentum of the rocket in the opposite direction, p r = m dv. 



Thus 


dv dm 
m — = —— u 
dt dt 


This is also true if the rocket is moving with velocity v. 


Pr = 0 

t = 0 
Pi = 0 

p r = m dv 
t = dt 
Pi = —dmu 


(3) 


v — u 


dm 
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The changes in momentum are the same, though the total momentum has changed. 

If you are really more comfortable talking about forces, you can use conservation of momentum 
to calculate the total force on the rocket, then you know that the rate of change of momentum of the 
ejected material is u dm/dt. Since this momentum is being produced, the rocket must be pushing 
on the stuff with a force 


F 


dp dm 

— = u —— 
dt dt 


(4) 


F = u dm/dt. And therefore, according to Newton’s third law, the stuff is pushing back on the 
rocket with a total resultant force of this magnitude. 

Notice again that the rate of change of the rocket’s momentum is independent of how the 
rocket is moving (remember that u is the velocity of the ejected fuel with respect to the rocket). 
This means that the force on the rocket looks the same in any inertial frame. This is an example 
of an important principle in Newtonian mechanics - Galilean Relativity. Galilean Relativity is 
the statement that Newton’s laws are valid in any inertial frame. Forces cannot change in going 
from one inertial frame to another, because the force is proportional to the acceleration, and the 
acceleration depends only on the rate of change of velocity. Any fixed velocity of the the inertial 
frame just cancels out when we compute the acceleration. 

Thus people in the rocket cannot tell how fast they are going without looking outside. Velocity 
is only defined in a particular inertial frame. But acceleration is something that the people in the 
rocket can feel because they can feel the force. The floor of the rocket is pushing against them to 
accelerate them along with the ship and they feel that force in their bones. But it is the same in any 
inertial frame. Thus the passengers can tell how they are accelerating, but they cannot tell how fast 
they are going without looking out the window (or integrating the acceleration from the beginning 
of the trip). 


The Lagrangian and the Euier-Lagrange equations 

In addition to discussing momentum conservation this week, we are going to begin our study 
of a remarkable reformulation of classical mechanics, based on the Lagrangian and the Euier- 
Lagrange equations. This will replace F = ma as the equation of motion of a classical system with 
something that is actually easier to use in most of the situations that really matter. This week, we 
are just going to write down the answer. Next week, we will talk bit about why it works, and why 
it is so interesting. Our reformulation is not just a great labor saving device, it is actually telling 
us something very deep and important about the way the world work. This will move us closer to 
understanding F = ma. But already this week, you should begin to see that the Lagrangian of a 
classical system and the Euier-Lagrange equations that we derive from it are better than F = ma 
in many ways. 

So without further ado, let me write down the answer for a single degree of freedom. We begin 
by constructing the Lagrangian, C. 

C(x, x, t ) = T(x, x,t) — U (. x , x, t ) (5) 

Where T and U are the kinetic and potential energies, as functions of the variable x that describes 
the configuration of the system, and its time derivative, x. This time I have included the possibility 
that these functions also depend explicitly on t. By explicit time dependence, I mean dependence 
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beyond that coming from the time dependence of x(t). We will come back to this a little later when 
we discuss energy. In terms of £, the equation of motion of the system is 


d_ 

dx 


£(x, x, t) 


d d 
dt dx 


£{x, x, t ) 


0 


( 6 ) 


Solving the differential equation (6) and imposing the initial conditions then gives us the classical 
trajectory x(t). 

I will illustrate this today with two examples. The first is a problem you already how to deal 
with - a mass m moving in force derived from a potential U(x). In this case the kinetic energy 
depends only on the velocity 

T(x) = ^ mx 2 (7) 

and so the Lagrangian depends only on x and x: 

£(x,x) = ^mx 2 — U(x) (8) 


So 


and the Euler-Lagrange equation is 


d_ 

dx 


£(x, x) = 


d 

dx 


£(x, x) 


~l u ^ 


= mx 


(9) 

( 10 ) 


or 


d dd d d 

dx C{x ' X) - Mdx C{x ' x) = ~Tx Uix) ~ dt mx = 


d 


d 

— —U(x) — mx = 0 
dx 


— —U(x) = mx 
dx 

or, because minus the derivative of the potential is the force, 


(ID 

( 12 ) 


F = ma 


(13) 


A better horizontal swing 


OK, so we showed that our funny looking Euler-Lagrange equation is equivalent to F — ma. So 
what? That and about ten bucks will get you dessert at Finales. But now let us do a more interesting 
problem related to one of the problems you just did. 


•— your arm 


lldTIUJ 

K/2 

HH 

you 


K/2 

QSLQSISI 

cart 

QSLQSLQ. 


The system shown above is a cartoon of a different kind of horizontal swing set from the one you 
studied in AS2. A cart with mass /i slides without friction in the x direction, attached to springs as 
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shown. You with mass m sit on the cart. You and the cart are at rest in equilibrium until t — 0. For 
t > 0, by pushing and pulling on the handle with your arm (or arms if you prefer), you slide back 
and forth on the cart, so that your horizontal position with respect to the handle is 

£ cos u>dt (14) 

where 



for some constant t. Call the horizontal displacement of the cart from equilibrium x. Let’s find the 
equation of motion for this system. 

One of the differences between this system and the one you studied on the problem set is that 
there is only one degree of freedom here — x. Your position with respect to the cart is completely 
fixed by the statement of the problem, so once we have determined x(t), we know everything. 

Another difference is that the resonant frequency depends on the total mass n + m, rather than 
just the mass of the cart. 

Yet another difference is that we didn’t mention the force on you or the handle. We don’t need 
to know it to do the problem. Since we don’t know the force, we can’t use F = ma directly 
(though you with sufficient ingenuity, you can find a way to do it). The the Lagrangian technique 
works just fine. 

The potential energy of the system is due to the spring, and we can take it to be 

\kx 2 (16) 

The kinetic energy of the system gets two contributions, one from the motion of the cart and one 
from your motion. The kinetic energy of the cart is just 

\^x 2 (17) 

Your motion is a little more complicated. Up to a constant, your horizontal position for t > 0 is 

x + £ cos u>dt (18) 

and thus your velocity is 

x — Ilo ( i sin uj f jt (19) 

and your kinetic energy is 

-m(x — lud sinccrff) 2 (20) 

The total kinetic energy is then 

X 1 

-/tT 2 + -m(x — £u>d sin u> d t) 2 (21) 

and the Lagrangian is 

11 1 

C(x,x,t ) = -/rT 2 + 2 m (^ — ^rfsincurff) 2 — -iUr 2 (22) 
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Now we can write down the Euler-Lagrange equation. 


—C = fix + mix — iuod sincurft) 
ox 


Thus the Euler-Lagrange equation is 



-Kx 


(23) 

(24) 


d 

dt 


(fix + mx 


rn£u)d sin Udt) 


-Kx 


(25) 


or 


(fi + m)x = —Kx + miuj cosc Odt 


(26) 


So at the end of the day, we see that this is a forced oscillation problem like the problem on the 
problem set, except that it involve the sum of the masses. This means that indeed we chose the right 
driving frequency to get resonance in (15). We can now put in the same form for a trial particular 
solution that you explored in the problem set: 


x(t ) = At sin uj dt 


(27) 


which satisfies 

So 


x(t ) = (—Auj^tsmujdt + 2AujdCosuJdt ) 


(fi + m) (— Aoj 2 d t sin u>dt + 2 Ac^ cos Udt) = —K A t sin u>dt + mico^ cos c Odt 


(28) 

(29) 


and 


miiXd 
2 (fi + m) 


(30) 


This system is animated in the Mathematica notebook swing-horizontal2.nb bundled with 
your lecture notes. 
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lecture 7 

Topics: 

Where are we now? 

Scattering and kinematics 
Elastic collisions 
Inelastic collisions 
Generalized Force and Momentum 
Example - bead on a expanding ring 
Example - bead on a rotating rod 
More degrees of freedom 


Where are we now? 

Momentum conservation and Lagrangians are two fantastic labor-saving devices. Next time we 
will see how they are related. This time we will just do a lot of examples. 


Scattering and kinematics 


The idea of scattering is very important in many subfields of science (and pseudo-sciences like 
economics for that matter). The idea is that one does not always have to follow the trajectories 
of all the particles in a process in detail to learn something about the process. Often, important 
information can be obtained by just looking at the initial state and the final state, and not asking 
about the details of what happens in between. Here is an example. Suppose that we have two parti¬ 
cles in three dimensional space interacting through a potential that depends only on the difference 
between the position vectors of the two particle. The energy is then 


mi 4 2 m 2 4 2 

n + r 2 + v ( r i ~ r 2j 


( 1 ) 


We will discuss in more detail later in the course why such a system conserves energy and momen¬ 
tum. For now, just note that this potential energy leads to forces that are consistent with Newton’s 
third law. The force on particle 1 is 

A - f2) ) <2) 
The force on particle 2 is 

p2 = - r-y ) <3> 


The chain rule implies that F) = — F 2 , so that Newton’s third law is satisfied, and momentum is 
conserved. 

Let’s also assume that V is “short-range” which means that 


V (r) = 0 for large |r 


(4) 
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or at least that V (r) goes to zero very rapidly as |r| —> oo. Even if we do not know what V looks 
like in detail, we can say interesting things about this system because energy and momentum are 
conserved. Now suppose that we consider a process in which particle 2 is at rest at the origin and 
particle 1 approaches from far away. Initially, the potential is irrelevant because the particles are 
far apart. Particle 1 has velocity vu- After the interaction, the two particles will typically be far 
apart again, so again the potential will be irrelevant. The importance of the fact that the potential 
energy is irrelevant is that it means that the total kinetic energy long before the collision is the same 
as the total kinetic energy long after the collision. At some intermediate time, when the particles 
were close together, some kinetic energy was converted to potential, but it all comes back as the 
particles separate. Thus, as long as one only asks about the initial and final states, kinetic energy 
is conserved. And of course, because Newton’s third law is satisfied, momentum is conserved at 
all times, and in particular for the initial and final states. This is a scattering process. We say that 
the two particle have scattered from one another. 

Now the point is that kinetic energy and momentum conservation put very strong constraints 
on this process. Suppose that particle 1 has velocity v \/ after the interaction and particle 2 has 
velocity v 2 f- 

Since the initial momentum and energy of particle 2 are zero, we can write: 

mi vu = m, v x f + m 2 v 2f — v u = — v lf + — v 2f (5) 

These constraints are very powerful. If you don’t know the mass of particle 2, for example, you can 
calculate it from the three velocities. Scattering gives you information about the particles involved. 

There is an interesting linguistic distinction that is made in calculations like this. We talk 
about the features that follow from very general principles like conservation of kinetic energy and 
momentum as the “kinematics” of the process. This is to be distinguished from the “dynamics” 
of the process, which is everything else — in particular the details of the force law. At this point, 
the distinction probably seems a little arbitrary, but in the next couple of weeks, as we begin to see 
how general these conservations laws really are, this distinction will be more and more important. 

Another place where the idea of scattering is crucial is in my own field of particle physics. I 
study particles that are very small. We can detect them, we can see their tracks, measure their 
velocities, and energies and momenta, just as we would with a larger object. But they are so small, 
that we cannot follow what happens when two of them collide in detail. We simply cannot measure 
the forces involved in the tiny fraction of a second during which colliding subatomic particles are 
in “contact” with one another. 1 What we do is scattering experiments, in which we measure the 
initial energies and momenta of the particles before the collision, and then again after the collision. 
Here, conservation of energy and momentum are really useful, because they put very strong limits 
on what can happen. We will discuss this a bit now, and then in much more detail in a few weeks 
when we discuss energy and momentum in relativity. 

'in fact, the whole notion of these particles being in contact with one another is rather problematic. It is not clear 
what it means. 
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Elastic collisions 


A familiar example of the use of kinematics is in elastic collisions of rigid bodies, like billiard 
balls. The force in this case is certainly short-range, because it is only non-zero when the balls are 
actually touching. “Elastic” is just a code word meaning that kinetic energy is conserved. 

A famous and beautiful result that follows simply from conservation of kinetic energy and 
momentum in an elastic collision is that if a moving particle with velocity v collides elastically 
with a particle at rest with the same mass, the dot product of the velocities, v\ and v 2 , of the two 
particles in the final state vanish, uj • v 2 — 0. Thus either one of the two velocities vanishes, or else 
the two velocity vectors are perpendicular to one another. This is very neat result, and it is easy to 
prove using conservation of kinetic energy and momentum. Conservation of kinetic energy gives 


-771 U 2 = -777 V? H-777 Vn 

2 2 2 


( 6 ) 


Conservation of momentum gives 

777 V — 777 V \ + '777 V 2 


(7) 


Eliminating the ms and taking the dot product of each side of (7) with itself gives 


v 2 = {vi + v 2 ) ■ (vi + v 2 ) = Vi + v 2 + 2 Vi ■ v 2 


( 8 ) 


Comparing this with (6) gives the desired result. No forces were ever mentioned. In fact, it does 
not even matter here which of the two particles in the final state was initially at rest and which was 
moving, because they have the same mass. 

There is another way of thinking about this result that is rather neat. Suppose that we look at 
the process in the zero momentum frame. We can get to this frame by moving all of our measuring 
apparatus with velocity v/2. In this new frame, particle that was moving now has velocity 

VCM = v/2 (9) 


and the particle that was initially at rest has the opposite velocity, 

-VCM = -v/2 


( 10 ) 


so the sum of the momentum vectors is zero. After the collision, again the particles must be moving 
with equal and opposite velocities, and energy conservation requires that the speeds be the same 
as the initial speeds. That is all the particles can do is to change direction without changing speed. 

Vicm = ~v 2C m with Vcm — V\ C M (11) 

Now we can go back to the original frame by just adding back the vcm from the motion of the 
frame. Then you can see that 


V\ — Vi cm + vcm 

Then taking the dot product gives 


v 2 — v 2 cm + vcm — —v\cm + vcm 


v\ ■ v 2 — (v\cm + vcm ) • {—vicm + vcm ) — ~ v l C m + v 


1CM 


= 0 


( 12 ) 

(13) 
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Inelastic collisions 


One of the hard parts of doing physics is figuring out what principles to use in a particular prob¬ 
lem. At some level, for example, we believe that energy and momentum are always conserved. 
Sometimes, as in the example we just discussed, kinetic energy and momentum are obviously con¬ 
served. But for example, when I take an egg and drop in on the floor, it is certainly not obvious. 
Before it hits the floor, the egg has kinetic energy and momentum. After it hits the floor, it is just 
a scrambled mess, sitting still, with no kinetic energy or momentum. What happened? Clearly the 
problem with the eggs momentum is that the egg is not an isolated system. The floor pushed up on 
the egg’s shell (and broke it), to change the eggs momentum. The problem with the egg’s energy 
is that energy can get transformed from one form to another. In the process of breaking, the egg’s 
parts actually heated up slightly, so slightly that we don’t notice it, but enough so that energy is 
conserved. 

Frequently, we are interested in collisions in which momentum is conserved, but kinetic energy 
is not. These are called “inelastic” collisions. The classic inelastic collision is two lumps of clay 
hitting and sticking. If the two lumps have masses rn j and m 2 with velocities V\ and v 2 respectively, 
the final momentum is 

miVi + m 2 v 2 (14) 


We can now compute the final velocity of the system by dividing by the total mass, 

_ ni\V\ + m 2 v 2 

Ai rial | 

m i + rn 2 


(15) 


There are two keys to obtaining (15). Notice that we have not used conservation of energy. What 
we used instead is the physical picture of the event. The key word is “sticking.” This means 
that the two lumps are moving with the same velocity after the collision. This allows us to use 
conservation of momentum alone to find the final velocity. There is also an unspoken assumption 
behind (15)— the common sense statement that mass is conserved, like energy and momentum — 
that the mass of the system of two lumps stuck together is just the sum of the masses of the two 
lumps. This is very reasonable, and it is consistent with what each of knows in our bones about 
the world. However, it is wrong. We will see in a few weeks that when particles collides at speeds 
close to the speed of light, conservation of mass may be completely wrong. Nevertheless, it is a 
very good approximation as long as none of the particles are moving at close to the speed of light, 
and because the speed of light is so enormous compared to what are used to, this is usually not 
much of a restriction. 

Since we haven’t used conservation of energy, it is of some interest to calculate how much 
energy is lost in the collision. This is 


m i^ 2 m 2 _ 2 

T ” 1 + IT 1 " 2 


mi + m 2 (miVi + m 2 v 2 \ 2 n _ , 2 

m 1+ms j ~ 2 Vl ~ 


(16) 


where // is an important quantity called the “reduced mass.” We will discuss more of this later. 
Note that the energy loss never vanishes in this process unless the lumps were moving together in 
the first place. Some energy is always lost whenever there is “sticking.” 
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Generalized Force and Momentum 


For a particle of mass m moving in a potential V(x), the Euler-Lagrange equation of motion can 
be written as 


dp d 
— = —(mx 
dt dt y 


d dC dC . 

= dtd± = !h = ~ ^ 


— the rate of change of the momentum is equal to the force. In 
suggests that we might regard the Euler-Lagrange equations, 


= F(x) (17) 

the more general situation, this 


d dC _dC 
dt dqj dqj 


(18) 


as a generalization of this — we call 

dC 

dqj 

the “generalized momentum” corresponding to the coordinate qj and 

cK 

d qj 


(19) 


( 20 ) 


the “generalized force” corresponding to the coordinate qj. Then the Lagrange equation says that 
the rate of change of the generalized momentum equals the corresponding generalized force. 

A particularly interesting case occurs when the Lagrangian does not depend at all on some 
coordinate q r In that case, (18) implies that the generalized momentum corresponding to q : j is 
constant. This statement becomes even more interesting when you realize that we have great 
freedom to choose the coordinates any way we want to. Thus if there is any coordinate system in 
which the Lagrangian does not depend on some coordinate, then there is a conservation law — the 
corresponding generalized momentum is conserved. 


Example - bead on an expanding ring 

Let’s warm up by doing a couple of examples of Euler-Lagrange equations for systems with a 
single degree of freedom. I’ll begin with a system that is mathematically very simple, but that I 
don’t actually know how to build. Suppose we have a small bead with mass m that slides without 
friction on circular ring centered in the x — y plane, but whose radius grows as a function of time 
as 

r(t) — r 0 +v rr t (21) 

It will be most useful to analyze this in terms the polar angle 6 in terms of which the x and y 
coordinates of the bead are 

x = r(t) cos 6 y = r(t ) sin 6 (22) 

We can find the kinetic energies by differentiating (22) to get the components of the velocity, 

v x = v rr cos 6 — 9 sin 9 v y = v rr sin 9 + 9 cos 9 (23) 
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We can also write the kinetic energy of the bead directly in terms of 0 using the fact that the radial 
motion of the bead due to the expansion of the ring and tangential motion associated with changing 
0 are instantaneously perpendicular. Either way se see that v 2 can be written as 

V 2 = i 2 + ^2 = v 2 _ + r ( t ) 2 ^2 (24) 

Remember here that r(t) is not a dynamical variable — the time dependence of r is imposed on 
the system from the beginning. There is no potential energy because the system is in a horizontal 
plane, so the Lagrangian is 

£(0,M) = 7T ( v rr + r(f) 2 0 2 ) (25) 

The Euler-Lagrange equation here is particularly simple because the Lagrangian does not depend 
at all on 0. Thus 

0 = ^Lc(9,Q,t) — — c(o,9,t) = = — — (mr(t) 2 9 ) (26) 

06 V ’ ’ ) dtdd V J dt 86 ^ ' dt V w ) 

What is nice about (26) is that the solution (or more properly the “first integral”) is really simple. 

m r(t) 2 9 = a constant (27) 


The relation (27) is an example of conservation of a generalized momentum. The Lagrangian does 
not depend on 9, only on 9, so the generalized momentum 


dC 

~89 


(28) 


is constant. In fact, as we will discuss in more detail later, this is the angular momentum, which is 
conserved because there is no torque on the mass about the origin. 

To find the constant, we need an initial condition. For example if the angular velocity at t = 0 
is 9(0) = ujq, then we can write 

mr(t) 2 9 = mr^cd 0 (29) 


thus 

| = dO = r 2 G cu 0 
dt (■ r 0 + v rr t ) 2 

We can now integrate this to find 9(t). Again we need an initial condition. If 


(30) 


0(0) = 0 O 


(31) 


then 


0(f) = 0o + / dt' 
J o I 


rl cu 0 


— 0o + 


cu 0 r o t 


(32) 


(r 0 + V rr t') 2 ” ' r 0 + v rr t 

There is another way of thinking about this which is slightly amusing. If I am a one dimensional 
creature living on this expanding ring, the variable I care about is not the angle, but the actual 
tangential distance I have traveled along the ring, which is 


£(t) = / r(t')9(t') dt' 


(33) 
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so that differentiating to get the velocity gives just the tangential velocity 


l(t) = r(t)0(t) 


( 34 ) 


Differentiating again to get the acceleration gives 

K*) = J t ( r (*) 6(t )) = j t (r(f) 2 0(t)/r(t)) (35) 

But from (26), the product of the first two terms is a constant so 

*(t) = r(t) 2 6(t) j t (l/r{t)) = r {t)H{t)^^=r{t)6{t)^^ = - r -^£{t) (36) 


or 

Now the point is that this looks like F 
force, 




(37) 


= ma for a particle subject to a velocity dependent frictional 

A*) i 


-m 


r(t) 


(38) 


Creatures living on this expanding ring would feel this friction as a result of the expansion of their 
“space.” It is very different from the frictional forces that we have talked about so far because 
this is a conservative system. No energy is being lost to heat, but instead, something that looks 
like friction is generated by the time dependence of the system. Something very much like this 
is going on in our expanding universe - it is called “Hubble friction.” You see here that is it just 
conservation of a momentum (in this case angular) in the expanding space. 

This system is animated in the Mathematica file lecture-7-l.nb. 


Example - bead on a rotating rod 

Next, consider a bead with mass m on a straight frictionless rod that rotates with constant angular 
velocity oj. For simplicity, we will let it rotate around the origin in the x-y plane. Then gravity 
plays no role. The angle with the x axis at time t is c ot. This would not be a trivial problem if we 
were armed only with F = ma. But with a Lagrangian it is easy. For one thing, the Lagrangian 
technique allows us to focus just on where the bead is on the wire. Let £ (which can be negative) 
be the distance along the rod where the bead sits at time t. This specifies the configuration of the 
system. The position of the bead at time t is then 

x — £ cos cut y — l sin cut (39) 

The velocity is 

x = £ cos c ot — it o sin ut y = £ sin ut + £co cos ut (40) 

Thus the kinetic energy is 

^m(x 2 + y 2 ^j = ^m(i 2 + £ 2 l u 2 ) (41) 
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The explicit time dependence from (40) has gone away when we form the square of velocity. If 
you think about the two terms on the right hand side of (41), you will realize that the first one is 
related to the motion of the bead along the rod, and the second one with the rotation of the rod. 
Because these two motions are perpendicular, there is no cross term and the total kinetic energy is 
just a sum of the two effects. 

In this case, there is no potential, so the Lagrangian is just given by the kinetic energy, (41), 
and it doesn’t depend on t explicitly, so we don’t need t in the list of variables, 

£{£,£) = ^m(e 2 + £ 2 uj 2 ) (42) 

and the Euler-Lagrange equation is 

0 = mduj 2 — m£) (43) 

or 

e = lu 2 £ 

This is a differential equation we can solve easily because it is linear and TTI 
is 

e(t) = C e wt -\- D e~ ult 

You are likely to see this general solution written as 

£(t) — A cosh u>t + B sinh ut (46) 


(44) 

. The general solution 

(45) 


in term of the so-called hyberbolic functions, 


sinh x 



e x + e x 
cosh x = --- 


(47) 


These are handy, because they have properties that are reminiscent of the more familiar trigonometic 
functions, sin x and cos a;. But if you are not used to them, you can always do everything in terms 
of exponentials. At any rate, if the bead starts from rest at i = £ 0 at t = 0, the solution looks like 


e ^t _j_ 

£{t) = £q cosh cut = £q --- 


(48) 


Exponentials get big quickly, so this bead gets going pretty fast. For example, in the time 7r/cu, 
while the rod rotates through 180°, the distance from the origin increases by a factor of 


cosh 7r 11.6 


(49) 


So this is at good way to launch things. 

If instead, the rod rotates with constant angular velocity oj around the origin in the x-z plane in 
the earth’s gravitational field, then we have to include the effect of gravity - but this is easy. Again, 
we take the angle with the x axis at time t is cut. 
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The configuration of the system looks the same except that z replaces y. Thus the kinetic 
energy looks the same. But now there is a gravitational contribution to the potential energy 


U(£,t) = mgz = rrigi sin ut 

Thus the Lagrangian in this case depends explicitly on t. It looks l ik e 


£(£,£, t) = -m(£ 2 + £ 2 u 2 ^ — mg£ sin cut 


and the Euler-Lagrange equation is 


d 


0 = m£u 2 — mg smut —— (m 


dt 


or 


£ = u 2 i — g sin cut 


(50) 

(51) 

(52) 

(53) 


More degrees of freedom 

The Lagrangian works just as well for more particles, or in more dimensions. Suppose that there 
are n particles, so that 


t =y. 

.-I 


m* .o 


and U — V (aq, 


i X r 


(54) 


Then it is easy to see by the same sort of arguments that 

SS 


where 


7 — = —rrij Xj 

5 Xj 


-Vj(x i, ■■■ ,x n ) = 


Vj(x i, 


, x , 


d 


dx. 


V(xi, ■■■ ,x n ) 


(55) 


(56) 


is the force on particle j. For S[x] to be an extremum, we must have (55) vanish for each j, which 
just gives F = ma for each particle. 

If the Lagrangian depends on more degrees of freedom, there is an Euler-Lagrange equation 
that must be satisfied for each coordinate: 


d / \ d d / \ 

C {x,x,t) --—£[x,x,t) =° 


dx 


(57) 


where the xs and xs in the function C now indicate all the components — that is jC(x, x. t ) is really 
a shorthand for 

£{x 1 ,x 2 , ■ ■ ■ ,xi,x 2 , ■ ■ ■ ,t) (58) 


9 



lecture 8 

Topics: 

Why Euler-Lagrange equations? 

Hamilton’s principle 

Functions of functions 

Calculus of variations 

Functional derivatives 

Finding functional derivatives 

Back to Hamilton’s principle 

The Fagrangian and the action 

Quantum mechanics and the classical trajectory 

Appendix: On the functional Taylor series 

Why Euler-Lagrange equations? 

In the last week, you have done the really important part of your study of Fagrangians, at least 
for this course. You have have learned how to construct them and to derive from them the Euler- 
Fagrange equations that replace F = ma as our equations of motion. You have seen in a non-trivial 
example (the second problem on as3) that the physical results are independent of the particular 
choice of coordinates you use to write down the Fagrangian. Next time, we will see how this 
gives a beautiful explanation of the great conservation laws of energy and momentum in terms of 
fundamental symmetries of the world. I hope that all of this has convinced you that the Fagrangian 
formulation of mechanics really simplifies your fife. We won’t come close in this course to seeing 
all the advantages of this beautiful way of doing mechanics (you can get a little sense of it by 
reading though the sections of Morin that I have not assigned). You can see more in Physics 151 if 
your appetite is whetted. 

Nevertheless, as I expected, and indeed as I saw on the QA this week, most of you would kind 
of like to understand why it works! This is not really fair of course. You don’t know why F = ma 
works either. You are just so used to it that it seems reasonable. But there is something that we can 
say about that, while it is hard, will help you understand some of the magic of Fagrangian. And, as 
I will try to convince you, it is telling something about the way the world really works at a deeper 
level. 

To do this, I have to introduce some mathematics that will probably be hard for most you - 
essentially the calculus of an infinite number of variables! This probably sounds really scary, and 
it is scary. But DON’T PANIC! Remeber that you have already done the really important part. 
This next week is mostly for your general education. But you will find, I hope, that this deeper 
formulation not only increases your understanding, but will give you a useful bag of tricks for 
dealing with practical problems. 

Here is the deal. The Euler-Fagrange equations can be derived from something called “Hamil¬ 
ton’s principle” which is the statement (which we will make more carefully in a moment) that the 
classical trajectory is the path of the classical system through time for which small changes in the 
path do not change a quantity called the “Action” which is obtained by integrating the Fagragian 
over the path. Most of todays class will be taken you by trying to understand what this statement 
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means and showing that it give rise to the Euler-Lagrange equations. At the end, I will try to give 
you at least a hand-waving idea of how this is related to quantum mechanics. 


Hamilton’s principle 


We will start with the very simplest example - a mass moving in a potential in one dimension. 
Consider the one dimensional motion of a particle with coordinate a: in a potential V ( x ). Call the 
particle’s ki netic energy 

T(x) = | x 2 (1) 

Now we would like to find a particular trajectory x(t) are such that the particle moves from X\ at 
time t\ to x 2 at time t 2 — or 

x(ti) = Xi x(t 2 ) = x 2 ( 2 ) 

Hamilton’s principle is the statement that if you compute the quantity S[x] (called the action — 
I’m not really sure why) - the following integral of the Lagrangian: 



C(x(t) 


dt 



V(x{t) 


dt 


( 3 ) 


depending on a function x(t) satisfying 1 then the variation of S [x] with respect to the function 
x(t) vanishes for the actual trajectory. The hard part of Hamilton’s principle will not be proving 
this statement, but figuring out exactly what it means to talk about the variation of something with 
respect to a function. 

The first thing to notice is that the Lagrangian C = T — V is an ordinary function of two 
variables, x and x, 2 but that S[x] is actually a function of a function — it depends on x(t) for all 
values of t from t\ to t 2 where x(t) can be any function satisfying (2) — something like 



1 We will see later that the restriction to functions satisfying (2) is important and has to do with the initial conditions 
that we have spent so much time talking about in the first couple of weeks. 

2 In the simple example we are working out now, T is a function of one of them and V of the other, but in general, 
both T and V could depend on both x and x. 
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Thus S'[a;] is a function whose argument is itself a function (such an object is sometimes called a 
functional and I may sometimes use that term). I have put the x in square brackets to remind you 
that in this case a: is a function rather than a number. The value of S[x], on the other hand, is just 
a number. It doesn’t depend on t. The variable t in (3) is a dummy variable. Now Hamilton’s 
principle is a statement about the variation of S[x\ as we let x(t) vary over all possible functions 
satisfying (2). 3 We will find that when x(t) is very near to a solution to Newton’s second law, then 
[a;] changes slowly as a function of whatever parameters you use to specify the function x(t). 
When x(t) is a solution, S[x] is not varying at all. 

Functions of functions 

I imagine that the idea of a functional — that is a function of a function — is pretty unfamiliar to 
most of you, but you shouldn’t get too worried about it. Most of you have learned or are beginning 
to learn about the calculus of functions of several variables, and dealing with functionals is not 
really much different. The main differences are in notation. In fact, I would argue that the big jump 
is going from one or two variables to more than that. Once you have made it to three variables, it 
really doesn’t get any harder to think about more — not even an infinite number more. 

Here is what I mean by the peculiar statement that going from two variables to three variables 
is what is hard. This has to do with visualization. As you have probably noticed, it gets harder to 
make a mental picture of what a function means as the number of variables increase. A function 
of a single variable is easy. We naturally associate a function f(x) with a graph of y = f(x). 
This allows us to do things that bring the function alive to us, such as relating the derivative of the 
function to the slope of the line in the graph. We can sort of do something similar with a function 
of two variables f(x, y ) by imagining a surface in three dimensional space with z = f{x, y ). This 
can be quite helpful, because it allows you to have a visual representation of features that only 
appear with more than one variable, such as the gradiant vector, which for two variables x and y 
looks like the two dimensional vector 


(d_ d\ 
\ dx ’ dy J 


( 4 ) 


The vector Vf(x,y) points in a direction that corresponds to going upwards on the surface z = 

f(x,y). 

But what do you do about visualizing functions of three variables, or more? There is really no 
ideal way of doing this. So already by the time you get to functions of three variables, you have to 
stop relying on visual crutches and just develop an analytic sense of what the function means. 

A functional like S[x] is just a function of an infinite number of variables where the variables 
are the values of x(t) at all possible values of t. The really peculiar new thing about a functional is 
that the variables are labeled by a continuous parameter, rather than having different names like x, 
y and z, or different discrete indices, like ai, 02 , etc. This is why functionals look so different and 
why we have to invent some new notation to deal with them. 

3 Really? All possible functions? One has to be reasonable here — it might be better to say all possible functions 
for which the integral in (3) makes sense — differentiable functions in this example. We will ignore such issues and 
leave them for the mathematicians. 
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A mathematical example of a functional is the length of a path described by a curve, y — f(x). 
The path length from X\ to x 2 is 


rx 2 /- rx 2 

P[/]=/ px 2 + dy 2 = 

Jx 1 -'Xl 



(5) 


The path length depends on the function / that defines the shape of the curve. This example is 
particularly interesting because it depends on the endpoints in same way that (3) does. 

A specific example of path length may be useful. Consider a graph y — f(x) for the function 


f(x) = V R 2 — x 2 
f'(x) = —xj\jR 2 — x 2 

Then the path length from x\ to x 2 is 


f Cl + f'(x ) 2 dx = f R/VR 2 — x 2 dx 

Jx i Jx 1 


which with the substitution x = R sin 9 becomes 

R 


/•arcsin x 2 /R / . X 2 

' dt) — R 1 arcsm — 

arcsin /R V JX 


arcsm 


Xl 

R 


( 6 ) 

(7) 

( 8 ) 

(9) 


And this is right because this is just the arc length along a circle, as show in the diagram below. 



Here is a more physical example. Suppose that the function y = f(x) in (5) describes the 
height of the track of of roller coaster. If a roller car is moving along the track with mass m and 
energy 


E 


2 

-mu 


+ mgy 


1 

2 


mv 2 + mg f(x) 


( 10 ) 


the time the car takes to get from x to x + dx is the infinitesimal path length, \Jd;x 2 -\- d:y 2 = 
dx ijl + f'(x ) 2 divided by the speed v which from (10) is 


v = 



mg f(x)^j jm 


( 11 ) 
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Thus the time the car takes to get from x\ to x -2 is 



\/ dx 2 + dy 2 
v 


rx 2 /L + f'{x) 2 

X1 \jz(E -mg f(x))/m 


( 12 ) 


The time (12) is a function of E, m and g. but it is a functional of the function f(x) that describes 
the height of the track. This is an important example to understand and we will try to come back 
to it several times over the next few weeks. 


Calculus of variations 

Now let’s return to Hamilton’s principle. The proof and indeed the more precise mathematical 
statement of the principle is an exercise in what is called the calculus of variations, which is sort of 
calculus with an infinite number of variables. It probably won’t surprise you that what we need to 
do is to generalize the notion of a derivative to functionals and to explore the nature of the Taylor 
series. After all, these are our main tools in understanding how things vary. 

The actual example we are interested in is complicated because the integrand on the right hand 
side of (3) depends on both x(t) and x(t), so let us first discuss a couple of simpler examples. 
Consider the quantity 

W[x] = J (x(t) — vt) dt (13) 

for some constant v and find the value of x(t) for which the variation of W [x] with respect to x(t) 
vanishes. We can do this by the following trick. Suppose that we have already found the soltuion 
x(t). Now consider small variations about this particular function: 

x(t) + 8x(t) (14) 

I’ve used the symbol 5 rather than A just to remind us that this is a “small” function rather than a 
small number. Now look at (13) as function of x + Sx, 

W[x + 5x\ = J (x(t) + Sx(t) — vt) dt 
= / (x(t) — vt) dt+ 8x{t) 2^x(t) — vt) dt + / (Sx(t)) dt (15) 

= W[x] + ^ 5x(t ) 2 (x(t) — vt) dt + 0(5x 2 ) 

Now we look at the coefficient of the linear term in 8x(t), which is 2 (x(t) — vt). This vanishes 
for all possible small functions Sx(t) if 


x(t) = vt (16) 

and this is the answer we are looking for. Around the function x(t) = vt, there is no linear term 
in the expansion of W[x\. Note that while t in (15) is a dummy variable, we can say something 
about the t dependence because we require that the variation vanishes for any possible dx(t). We 
will come back to this below when discuss functional derivatives. 
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This result is quite reasonable and maybe obvious in this particular example because W[x] 
is actually minimized for x(t) = vt. The variation of a functional at a minimum vanishes for 
the same reason that the derivative of a smooth function vanishes at a minimum. If there were a 
linear variation, then on one side or the other, the result would be larger, which is impossible at a 
minimum value. But what we want to extract from this calculation is not the result, but the general 
technique. 


Functional derivatives 


What we did to solve the variational problem for (13) was to set to zero the coefficient of the linear 
term in Sx(t) in W[x + 5x\. It is useful to give this coefficient a name, so we call it a functional 


derivative, and denote it by 

SW M 
Sx{t) 

" 6x(t) W ® 

(17) 

so in this case, we can write 

SW M 
8x(t) 

= 2 (x(t) — vt') 

(18) 


Notice that the t dependence of the RHS of (18) comes from the t dependence of the differential 
in the denominator of the LHS — again W[x\ for some particular x is just a number with no t 
dependence. 

This definition of functional derivative is generally useful, so let’s discuss it in general: 


5 

8x(t) 


W[x\ is the coefficient of the linear term in 8x(t) mW[x + Sx\. 


(19) 


This is a reasonable definition because it makes the functional Taylor series work in the same way 
that the ordinary Taylor series does. In terms of the functional derivative, the functional Taylor 
series starts like this: 


W[x + Sx] = W[x] + f* 8x(t) ^4- W[x] dt + • • • (20) 

Jti ox(t) 

In words, this says that the total change in the functional is obtained by adding up (actually in¬ 
tegrating because the variable is continuous) the small changes 8x{t) in x(t) at each point times 
the rate of change of the functional at that point. This is just what we always say, except that the 
number of variables has become continuously infinite. (20) is precisely analogous to the Taylor 
series for functions of several variables that we will discuss in the appendix and in future lectures: 

_^ 

F(x i + Axi,x 2 + Ax 2 , ■■■) = F(xi,x 2 , • • •) + H A Xj —F(x 1 ,x 2 , • • •) 4- (21) 

j ax i 

The only difference is that instead of the sum over all the different variables that we have in (21), 
in (20) we need an integral over t because in effect, the value of x(t) at each value of t is a separate 
variable. 
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Finding functional derivatives 


In general, functional derivatives can be difficult to find, but it is easy to find them for functionals 
like W[x] that have the form of an integral of an ordinary function — 

W[x] = f* F(x(t )) dt (22) 

In this case, the functional derivative of W[x] is related to the ordinary derivative of F — 

W) [x] = <23) 


The reason this is so simple is that we can calculate the functional Taylor expansion of W using 
the ordinary Taylor expansion of F, and looking for the coefficient of dx(t). Let’s show how this 
works. 

W[x + 8x\ = ^ F(x(t) + 8x(t)^j dt (24) 


But using the ordinary Taylor expansion for the ordinary function F(x), the right hand side be¬ 
comes 



F(x(t)^j + 8x(t) F'(x(tfj 



(25) 


Picking out the coefficient of Sx(t) on the right hand side gives (23). Notice the way that the 
variable t gets promoted from being a dummy variable in (22) and (24) to being a real variable in 
(25). We single out a particular value of t when we find the coefficient of Sx(t) for that t. 

The function F in (22) may depend on other functions of t, in which case we can generalize 
(22) as follows: 


5 

8x(t) 


J t F (x(t'),y(t'),--^dt' 


d_ 

dx 


F(x,y(t),---)\ x=x{t) 


(26) 


where ... denotes and other functions that F depends on. Note that in (26), t! is a dummy index, 
integrated over so that it could be called anything except t. The dependence on t is real (not 
dummy) because the process of functional differentiation picks out a value of t. Here are some 
examples: 


F[x] — f x(t') 3 dt' 

Jt i 



F[x\ — j sin (x(t')^ dt' 


fe(i) F[ll=C “( a:(<) ) 

F[x,y] = £ (x{t')y{t')Y d£ 



F[x, V\= J f sin (x(t') y(t ')) df 


£ 

8x(t) F ^ x ' y ^ = y ^ cos ( x ^y^) 


We can summarize these ideas by saying that when the functional is the integral of an ordi¬ 
nary function, what the functional derivative does is to eliminate the integral and differentiate the 
function. Think about it in the context of the examples in (27). 
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Back to Hamilton’s principle 


With this new tool of the calculus of variations, we can go back and consider Hamilton’s principle. 
Remember that what we want to do is to show that the functional S[x] has zero variation around 
the same trajectory that we get by using F = ma. 

According to our discussion, what we want to do to impose vanishing variation on S[x] is to 
set the functional derivative of S[x], 


5S 

8x(t) 


(28) 


to zero. As in the simpler example, we calculate the functional derivative by performing a func¬ 
tional Taylor series and picking out the coefficient of 8x(t). For pedagogical purposes, we will 
break this up into two pieces: 

S = S T ~S V (29) 


where 


and 


S T [a;] = jj T(x(tj) dt = j" ™(x(t)f dt 

" " rtf; 2 

v(x(t)^ dt — J v(x(t)^Jdt 


For the S v term, because V does not depend on x, this is just like the W[x\ example, 


(30) 

(31) 


5S V 

8x(t) 


= V'(x(t)) 


(32) 


This is a good sign because the derivative of the potential is related to the force. 


For the Sy, term, we will need an extra step, so let’s write out the functional Taylor series in 
detail. 



St[x + 8x] — J —(x(t) + 8x(t)^ dt 

(33) 


= J —(x(t)^dt + J 8x(t) mx(t) dt + 0(8x 2 ) 

(34) 


= <Sr[x] + f 5x(t) mx(t) dt + 0{8x 2 ) 

J 

(35) 

But the 8x(t) in (35) is the change in x(t) when we make a change 8x(t) in x(t). Thus since 



±(z(t) + 6z(t)) = m + 

(36) 

8x(t) is given by 

Sx(t) = —Sx(t ). 
dt 

(37) 
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The trouble with (35) is that it depends on 8x(t), while we have defined the functional derivative 
to be the coefficient of 8x(t). To take care of that, we can integrate by parts: 



8x(t) mx(t ) dt 


It dt 



d_ 

dt 


(8x(t) mx(t)^j dt 



8x{t) mx(t ) dt 



8 x(t) mx(t ) dt + 8 x{t 2 ) rnxit?) — 8x{ti) mx(ti) 


(38) 


The last line in (38) has two terms from the endpoints of the integration. These don’t look like 
everything else, and we want to get rid of them. This is where (2) comes in. We are supposed to 
be restricting our attention to functions satisfying (2). But if x{t) and x{t ) + 8x(t ) both satisfy (2), 
then 


8 x{ti) = 8 x(t 2 ) = 0. 


(39) 


In words, we are only interested in variations that vanish at the endpoints. These functions might 
look something like this 



So in fact, the funny looking terms vanish because of (2). Then (38) becomes 



8x(t) m x(t ) dt 



8x(t) mx(t ) dt 


(40) 


Now we can pick out the coefficient of 8x(t) and see that 

8St , 

= (41) 

Now, finally, we can verify that Hamilton’s principle is correct. Putting (32) and (41) together, 
we have that the condition for vanishing variation of S[x] is 


8S 

8x(t) 


— mx(t ) — V'[x{t)^j = 0 


(42) 
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or 

mx(t ) = —V'(x(t)') (43) 

which, since F = —V', is just Newton’s second law, which in this case, we know, is also the 
Euler-Lagrange equation. 

The Lagrangian and the action 

As we will see, Hamilton’s principle really captures more of what is going on in the world than 
F = ma. Let’s see consequences of Hamilton’s principle again in a more general language in 
terms of the Lagrangian, C. Again the action is 

S[x] = J £(x(t),x(t)^ dt (44) 

Here, x might actually have indices that allows it to represent more than one particle or dimension 
or both. We won’t write them explicitly in this formal derivation. Now we want the functional 
derivative of S[x\ with respect to x(t) to vanish. In general, the functional derivative is 

The first term arises from the Taylor expansion of the x(t) dependence. The second term arises in 
the same way as (41), from the Taylor expansion of the x(t) dependence, followed by an integration 
by parts, which gives the minus sign. Thus Hamilton’s principle implies that the solution for the 
motion satisfies the Euler-Lagrange equation(s) 

F_c( x (t),i(t)) - = 0 < 46 > 

If x has several components, (46) must be true for each component separately. 

Quantum mechanics and the classical trajectory 

In my view, the most important reason that Hamilton’s principle and the Lagrangian are so im¬ 
portant has to do not with classical mechanics alone, but with quantum mechanics and the way in 
which classical physics emerges as an approximation to the quantum world. 

Lrom the classical point of view, Hamilton’s principle is actually a little peculiar. Why should 
it matter to a classical particle what the value of the action is for paths that the particle does not 
actually take? OK - - so Hamilton’s principle works to give the classical equation of motion, but 
it is hard to figure out what it means physically. But in quantum mechanics, it has a very definite 
meaning, because in quantum mechanics, the particle really takes all paths! This is absolutely 
nutty, but this is really the way the world works. Roughly, the way it works is this. When a 
quantum mechanical particle moves from point x\ at time t\ to point x 2 at time t 2 , it takes all 
trajectories from the starting point to the end simultaneously. But the different trajectories can add 
together like the different ripples in a wave on a pond. Associated with each trajectory there is a 
complex number A whose phase is the action divided by h, Planck’s constant over 2n. 

e iS[x]/h ( 47 ) 
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The most likely trajectories are those that are near the classical trajectory, because the action is 
changing very slowly for these trajectories, they have approximately the action of the classical 
trajectory, and therefore all the As have the same phase and all the nearby trajectories add up 
coherently. Trajectories very far from the classical trajectory are unlikely because the phase is 
changing rapidly and nearby trajectories have different phases and add nearly to zero. 

There is another way of putting this that is perhaps more interesting. Euler-Lagrange equations 
are differential equation, but not every differential equation is an Euler-Lagrange equation. The 
fact that the world is quantum mechanical explains why the classical physics that we see can 
be described by the solutions not just to any old differential equation, but specifically to Euler- 
Lagrange equations. We will see why this distinction is important as we go along. 


Appendix: On the functional Taylor series 

It might be useful for some of you if I expand on the statement of the functional Taylor series 
because it a pretty piece of math. Some of you, on the other hand, may find this rather terrifying. 
Please DON’T PANIC. This section is called an appendix because it is completely optional. I 
won’t get to it at all in lecture. But for some of you it may be fun. 

The picture of the functional Taylor series that I will give you is related to the following state¬ 
ment that you may find useful. The t in a function x(t) can be thought of as a kind of index, 
labeling components of an infinite dimensional vector. Or to put this the other way around, a vec¬ 
tor r can be thought of as a function of the index that labels the component — ry is a number for 
each j just like x(t) is a number for each t. In this way of thinking, a functional, W[x] is like a 
function of several variables, f(r i, • • •, r n ) (depending on an n-dimensional “vector”), except that 
W[x\ depends on an infinite number of variables, the values of x(t) for some range of t. Now what 
does the Taylor expansion look like for a function of several variables, like f(r i, • • • , r„)? We can 
build up the Taylor series by looking at one variable at a time: 


/(ri+ «!,-• 

. ( d 

1 2 d 2 

' j r n) ~ 1 + - V 

\ or 1 

2 ai ^f + 

/(n + ai,r 2 + a 2 , • • 

x ? 8 

1 2 9 2 

' ) r n) — 1 + a 27:-h 

V or 2 

2 a ' 2 d ? 2 + 


and so on for all n variables 


f(r 1 + a 1 ,r 2 , ■ ■ ■ ,r n ) 


(48) 


This looks complicated, but it can be simplified easily using a very beautiful form for the Taylor 
expansion of one variable: 

/ f) \ 00 (1 ^ r/k 

g(x + a) = exp J g{x) = £ ^ g(x) (49) 

The expansion of the exponential precisely reproduces the terms in the Taylor expansion. Using 
(49), equation (48) becomes 


f(r 1 + ai, • • •, r n + a n ) 





(50) 
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But because the product of exponentials is the exponential of the sum of the exponents, this can be 
written as 

( n <9 \ 

f(r i + a u ■ ■ ■ ,r n + a n ) = exp I I /(ry, • • • ,r n ) (51) 

This is the Taylor series for more than one variable. It is rather neat that it looks just like the Taylor 
series for a single variable, (49), except that we have to include derivatives with respect to all the 
variables in the exponent. Now for an infinite number of variables labeled by a continuous variable 
t, in some range ti < t < U, the argument goes the same way, but instead of summing over the 
indices, as we do in (51), we must integrate over the continuous variable t, so that the functional 
Taylor series looks like 


W[x + 5x\ = exp 



W[x] 


(52) 


Note that if we expand the exponential, we get a first term which is just (20), as expected from 
our earlier discussion. 
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lecture 9 


Topics: 

Where are we now? 

Energy again 

Example - frictionless table 
When is F the energy? 

Symmetries and transformations 
Example: space translations for one particle 
Space translations for two particles 
Space translations for many particles 
Finding Symmetries 
Rotations 
Noether’s theorem 

Momentum conservation from Noether’s theorem 

More on rotations 

What functions are invariant? 

Example of functionals - Soap bubbles 

Where are we now? 

There is much more in these notes than we can possibly cover in class. But I thought I would 
put them all up and then decide what to discuss in detail after I see your QA responses tomorrow 
morning. 

We have seen how conserved quantities can arise as the generalized momenta associated with 
variables that do not appear in the Lagrangian. Here we will discuss a generalization of this fact, 
that is an even more important principle. In Lagrangian mechanics, continuous symmetries lead 
to conserved quantities. We have already seen one example of this in our discussion of energy, 
and we will begin by making the connection with time translation invariance. We will go on to 
discuss symmetries more generally, and also in more detail the specific example of space transla¬ 
tion symmetry, which leads to the conservation of total momentum. Because we will be talking 
about symmetry, at the risk of encouraging people to waste time, I have included a Mathematica 
file Kaleidoscope.nb that produces kaleidoscopic images with various different symmetries. I like 
it both because I like symmetry, and because it was so simple to construct with complex numbers 
(if only I could have worked in the Taylor expansion ...). 

Soap bubbles are a beautiful example of a functional minimization problem. The last sections 
discusses a simple type of soap bubble for which the analysis is the same as in Hamilton’s principle. 

Energy again 

Now we are going to do a little math that will lead to a remarkable and beautiful result. If the 
Lagrangian does not explicitly depend on time, we will find that we can construct a function of 
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the coordinates and the velocities of the system that does not change with time for any solution 
to the Euler-Lagrange equation. Often, we can identify this quantity with the energy. We will do 
this first for a single degree of freedom, and then extend the result to systems with more degrees 
of freedom. 

Consider a system with a single degree of freedom described by the Lagrangian 


C(q,q,t) 


( 1 ) 


for some single coordinate q, and construct the quantity 


dC „ 
F = g-yr - C 
dq 


( 2 ) 


In general, F may depend on q, q and t. Let us now ask how F changes with time, by taking the 
total derivative 


d d (. dC\ d 

dt dt \ dq) dt 

In the first term on the right hand side of (3), we use the product rule to write 


( 3 ) 


d ( .3C\ .. dC . d dC 

It { q Iq)~ q Iq +q ItIq 


( 4 ) 


In the second term on the right hand side of (3), we use the fact that the t dependence of C comes 
from the explicit t dependence, and also from the implicit dependence on t through q and q: 


d 

dt 


r ~ dC 
F = qjrr 
dq 


dC 

dq 


dC 

~dt 


( 5 ) 


The relation (5) is an example of one of those multivariable calculus things we have talked about 
before that will make your eyes glaze over if you just stare at the symbols. But if you translate it 
into words, it makes perfect sense. It says that the total rate of change of F with t is the rate of 
change of q times the rate at which F changes with q plus the rate of change of q times the rate 
at which F changes with q plus the rate of change from the explicit time dependence. It is simply 
a matter of adding up all the possible sources of time variation of the function F. Subtracting (5) 
from (4) and using the Lagrange equation 


d dC _dC 
dt dq dq 


we get 


d p- _ dC 

dt dt 


( 7 ) 


Thus if C does not depend on time EXPLICITLY, but only implicitly through the time dependence 
of q and q, the function F is constant for the trajectory. 

It is important to understand what is meant here by the words “explicit” and “implicit”. Explicit 
time dependence occurs only if there is some physics in the problem that changes with time. On the 
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other hand, ANY function of q and q depends implicitly on time, because q and q for the trajectory 
depend on time. 

Usually, this function F is the Energy! For example, suppose we look at the Lagrangian for a 
particle moving in a potential 

C(x, x) = ^ x 2 — V(x) (8) 

For this Fagrangian, the function F is 


d 777, 

x —jC(x, x) — C(x, x) = xmx — C(x, x) = — x 2 + V (x) 

C/X Zj 


(9) 


which is the energy, as promised. 

For example, in the example we discussed last time of the bead on the horizontally rotating 
rod, where the Fagrangian is 

£(£,£) —-m(J. 2 + £ 2 cu 2 ^j (10) 

the construction of (2) gives 

F = im£ — ^ m(i 2 + f 2 cu 2 ) = ^ m (£ 2 — f 2 cu 2 ) (11) 


These two examples are rather different. In the second, the fact that the Fagrangian has no 
explicit time dependence is an accident, arising from the cancellation of the ut dependence that we 
talked about earlier. This is related to the fact that F in this case is not the kinetic plus the potential 
energy, but rather a curious combinations of the terms in the kinetic energy. It is conserved, but the 
physical interpretation is obscure. 

In the first example, however, the particle moving in a potential, the Fagrangian does not de¬ 
pend explicitly on t because there is a symmetry of the system. The symmetry in this case is time 
translation invariance. In this case, F really is the physical energy. This is the first of several impor¬ 
tant examples we will see in this course of the connection between a symmetry and conservation 
law. We will explore this further next week. 

For a vertically rotating rod, C depends on time explicitly because of the factor of sin u>t in the 
potential energy, F is not conserved. 

The construction of the energy function makes it clear how to deal with more degrees of free¬ 
dom. If q has an index, q :i where j goes from 1 to n, the analogous construction for the function F 
is 

F = E ii tr - £ < 12 > 

It must have this form for the analog of (3)-(7) to be valid, because the analog of (5) for more 
degrees of freedom is 


d r ST - dC j. sr ■ 


d qj 


dC 

dt 


(13) 


j ^ j 

This requires that the same sum over degrees of freedom appears in the first term in (12). With F 
defined as in (12), (7) is still valid. 
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Thus the construction (12) is very general. In fact, it reproduces the usual expression for the 
energy as the kinetic plus the potential energy whenever the potential depends only on qj and 
the kinetic energy is proportional to two powers of the velocity. However, it is really even more 
general than that. For any Lagrangian that does not depend explicitly on t, (12) defines a conserved 
quantity. And if the explicit t dependence vanishes because of time translation invariance, the 
conserved quantity is the energy. 

Example - frictionless table 

Here is an example of a Lagrangian for a system with two degrees of freedom. Consider a friction¬ 
less table in the x-y plane with a hole at the origin. A mass rxi\ slides on the surface of the table, 
but it is attached to a massless string of length R which goes through the hole in the center of the 
table and hangs straight down where it is attached to a mass m 2 . We can describe the configuration 
of the system by giving the length, i, of string on the table and the angle, 6, of the string on the 
table from the x axis. From above, this looks like this: 
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From the side, it looks like 



The kinetic energy contains a term from the rate of change of i, proportional to the sum of 
the masses (because both masses move when t changes), and a term from the rate of change of 6, 
proportional to mi — 

T(£,9,£,9) = ^(mi + m 2 )f 2 + (14) 
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doesn’t depend on 9. Thus the generalized momentum corresponding to 9, 


—r = mil 2 9 (20) 

o9 

is constant. This conserved generalized momentum is another example of angular momentum 
about the origin, which is conserved because the system has a rotation symmetry about the origin. 
For this system 

C(£,9,£,9) = ^(mi + m 2 )£ 2 + ]mii£ 2 9 2 - m 2 gl (21) 


there is no explicit time dependence, so we expect a conserved energy. Because there are two 
degrees of freedom, we have to use the construction (12), which gives for the conventional energy, 
T + U, 


F — £ (mi + m 2 )£ + 9 rrii£ 2 9 — Emi + m 2 )£ 2 


-mil v + m2gi 


- (mi + m 2 )£ 2 + -mi£ 2 9 2 + m 2 gl 


( 22 ) 


When is F the energy? 


To even get started on this, we have to restrict ourselves to situations where there are no frictional 
forces, and therefore nothing is converting kinetic and potential energy into heat. Since heat is just 
kinetic energy of random particle motion, this really just means that we need to look at a system at 
a sufficiently fundamental level to see all the kinetic energy explicitly. For now we will take care 
of this by restricting ourselves to systems described by Lagrangians of the form T — V. Then the 
first answer is that F is the energy whenever the kinetic energy T is quadratic in the velocities, q 
and the potential energy V does not depend on the velocities. The reason is that the differential 
operator 

z w~ (23) 

oz 

counts the degree in z, because 


z^- z n = znz n - 1 = nz n 
oz 


(24) 


That is, this operator acting on a term that is some power of z just gives the term back multiplied 
by the power. When we sum over all the velocities, the differential operator 


E Qj 


d_ 

dQj 


(25) 


adds up the powers of all the velocities. When this acts on a function quadratic in the velocities, 
every term in the function just gets multiplied by 2. For example 




d_ 

dqj 


•2 . d , 2 

Qi =9 i « t - Qi 
oqi 


2 qi 


(26) 
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and 


(27) 


v- . d . . . d . . . d . . 

Qj w^QiQ- 2 = qiwr- q i q-2 + q2^r- q\ q2 = gi <?2 + q\ q2 = 2 <?i g 2 
j dqj d('i\ oq 2 

Thus if T is quadratic in the velocities and V is independent of the velocities, 




(28) 


and then 

dC 

F = Y q,- - C = 2T - (T - V) = T + V = E (29) 

T dq i 

and as promised, F is the energy. 

At the fundamental level, the kinetic energy is always quadratic in the velocities, because it is 
just a sum over all the parts of the system of \mv 2 . But what can go wrong with this can be seen 
in examples like the bead on a rod rotating with fixed angular velocity u. Sometimes, we have to 
separate the variables that describe the system into those that are dynamical - like the position of 
the bead along the rod - and those that are imposed by some “external” constraint - like the angle 
of the rod, which is fixed whatever is causing the rod to rotate. Then we include in our list of 
coordinates only the dynamical coordinates and put the effect of the others into the Lagrangian by 
hand. Then if our external constraint is causing the motion, some of the velocities that appear in 
the kinetic energy do not correspond to dynamical coordinates in our Lagrangian, and they are not 
included in the sum, 

qj a- ( 3 °) 

j Q qj 

In such a case, (29) is not correct, and F is not the energy. 

In a system with time translation invariance, however, there can be no time dependent external 
constraint. Such a thing would look different at different times and break time translation invari¬ 
ance. Thus time translation invariance does two things. Not only does it ensure that the Lagrangian 
does not depend explicitly on time - which implies that F is conserved. But is also ensures that the 
kinetic energy is quadratic in the dynamical velocities, which ensures that F is the energy. In fact, 
time translation invariance is the more general answer to the question. Even if C is not in the form 
T — V, time translation invariance implies that F is the energy. 


Symmetries and transformations 

What is a symmetry? We have talked about several examples, so perhaps we should define pre¬ 
cisely what we mean by it. Symmetry is a mathematical statement of some very specific regularity 
in a system. A system has a symmetry if there is some transformation you can make that leaves the 
system looking exactly as it did before. We tend to regard things with many symmetries as pretty, 
like the kaleidoscope that we saw at the beginning of lecture, which has many planes of symmetry. 

In the case of mechanics, we have an even more specific meaning in mind. Let us now consider 
a class of symmetries in which we make some transformation of the coordinates describing a 
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system at a fixed time. What this means mathematically is that we define a new set of coordinates 
as functions of the original coordinates. The transformation is then a symmetry if the physics looks 
exactly the same in terms of the new coordinates as it did in the old coordinates. 

We talked briefly about such a transformation when we discussed the double pendulum with 
two equal masses in lecture 2. The Lagrangian for the double pendulum for small oscillations 
looks approximately like 

y + xl) - J {x\ +4) - y (xi -x 2 ) 2 (31) 

This has the property that it is unchanged if we interchange x\ and x-i- This is the mathematical 
statement of the obvious physical symmetry of the system. 

The symmetry of the double pendulum is an example of a discrete symmetry, so-called because 
the symmetry is an all or nothing sort of thing. The transformation cannot be made bigger or 
smaller - it is fixed by the structure of the symmetry. 

It is more even interesting to consider symmetries in which the symmetry transformation can be 
made arbitrarily small. Such a thing is called a continuous symmetry, because the transformation 
can change the system continuously. By putting arbitrarily small transformations together, we can 
get a whole set of transformations which, unlike the symmetry of the double pendulum, depend on 
a parameter that can be continuously varied. 

An example is translations. We think that space probably looks the same everywhere, and we 
could describe this by saying that there is a symmetry in which we move everything by the same 
arbitrary vector and we would end up with a completely equivalent physical system. 

Here is the general theoretical setup (we’ll discuss examples in more detail shortly). Consider 
a system of n degrees of freedom described by coordinates qj for j = 1 to n. Let’s assume 
that there is a symmetry in which each of the coordinates changes only a tiny bit, proportional 
to an infinitesimal parameter, e and that the changes involve only the current configuration of the 
system. What we mean precisely by infinitesimal is that e is sufficiently small that we can always 
ignore terms of order e 2 . Translating what we have just assumed into mathematics, we consider a 
symmetry in which the coordinates q :i are transformed as follows: 

Qj -»• Qj = Qj +£K qj (q) . (32) 

That is each of the coordinates changes by e times a function n q . ( q ) of the qs. The K qj ( q ) tells you 
how the variable qj changes under the transformation. The transformation (32) is a symmetry of 
the Lagrangian if 

£(q,q) = £{qA) (33) 

Example: space translations for one particle 

Here is a simple (perhaps even boring) example. Consider a particle with mass m moving along 
the x-axis in a potential. When does this system have a symmetry under the infinitesimal transfor¬ 
mation 

x —> x = x + eQ (34) 
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( 35 ) 


This transformation has the form of (32) with n(x) = 1. The Lagrangian looks like this: 


C(x,x) = -mx 2 — V(x) 

The condition that (34) is a symmetry is then 


C(x, x) = C(x, x) 

(36) 

Because k(x) is just a constant, we have 


d 

X = —x = X 

dt 

(37) 

and so (36) becomes 


C{x + e, x) — C(x, x) 

(38) 

for infinitesimal e. Because the x doesn’t change and kinetic energy is the same on both sides, so 

this condition only effects V - 


V{x + e) = V{x) 

(39) 


But because this is supposed to be true for any infinitesimal e, we can use the Taylor expansion 
(surprise, surprise) to rewrite (39) as 


V(x + e) = V(x) + e V'(x) + 0(e 2 ) = V(x) (40) 

If this is to be satisfied for infinitesimal e, we must have 

V\x) = 0 (41) 

so that V(x) is just a constant and the particle has no force on it at all. In this case, mv is a con¬ 
served momentum. We will see how this connection between symmetry and conserved momentum 
generalizes to more complicated (and more interesting) situations. 

Another reason that I wanted to look at this simple system in detail is to emphasize the dif¬ 
ference between continuous and discrete symmetries. Suppose that instead of being constant, the 
potential in (35) is 

V(x) = —E 0 cos(x/£) (42) 

where E 0 and i are constants. This system also has a symmetry under space translations of the 
form 

x —> x = x + 2nn£ (43) 

for any integer n. But here we clearly cannot conclude that V (x) is constant because we started 
with an example with the symmetry that is not constant. Except at special points where the particle 
is in equilibrium, there is a force on it. There is no conserved momentum (though energy is still 
conserved because the Lagrangian does not depend explicitly on t ). The difference between this 
and the previous example is that this is a discrete symmetry. The changes in x that leave the 
Lagrangian invariant are a discrete set. They cannot be varied continuously, and they cannot be 
made infinitesimally small. Thus we cannot use the Taylor expansion argument to conclude that 
V(x) is constant. 
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Space translations for two particles 

Space translation symmetry becomes interesting and important when there is more than one par¬ 
ticle. Let us now consider a one-dimensional system of 2 particles, with positions x\ and x 2 , so 
that qj = Xj for j = 1 to 2. A space translation in the x direction just adds the same infinitesimal 
constant, e, to both x\ and x 2 — so the transformation has the form 


X\ —> X\ = Xi + e , x 2 —> x 2 = X 2 + e . (44) 

Notice that this satisfies (32), with q\ = xi, q 2 = x 2 and k Xi (. x ) = k X2 (x) = 1. This is a symmetry 

of any Lagrangian that depends only on x\ and x 2 and the difference between X\ — x 2 , for example 

C{x,x) = ^ ±1 + ^ x 2 2 - V{ Xl - x 2 ) (45) 

The transformation (44) is a symmetry of the kinetic energy because e is a constant, so that 

xi — Xi , x 2 = x 2 . (46) 

It is a symmetry of the potential energy because the es cancel when we subtract one coordinate 

from another, so that 

Xi- x 2 = (xi + e) - (x 2 + e) = x x - x 2 . (47) 

Putting (46) and (47) together implies 

C(x,x) — C(x,x). (48) 

This system has a conserved momentum, 

p = miXi + m 2 x 2 (49) 

because the forces that come from the potential energy obey Newton’s third law. This in turn is 
related to the fact that the potential energy depends only on x\— x 2 , which in turn is related to the 
symmetry. We will see that this connection between a continuous symmetry and the existence of a 
conserved momentum is a general thing. 

Space translations for many particles 

Let us now consider a one-dimensional system of n particles, with positions x r so that q y = x 3 for 
j = 1 to n. A space translation in the x direction adds the same infinitesimal constant, e, to each 
Xj — so the transformation has the form 

Xj —> Xj = xj + e Vn . (50) 

Notice that this satisfies (32), with qj = x 3 and all of the k x (x) = 1 . This is a symmetry of any 
Lagrangian that depends only on x 3 and differences between two XjS, for example 

C(x,x) = - V(x!-x 2 , x 2 -x 3 , • • • , Xj-X j+ 1 , • • • , x n -i-x n ). (51) 
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Again, (50) is a symmetry of the kinetic energy because e is a constant, so that 

Xj = Xj . (52) 

It is a symmetry of the potential energy because the es cancel when we subtract one coordinate 
from another, so that 


Xj - x j+ 1 = (xj + e) - (x j+ i + e) = Xj - x j+ i. (53) 

Here the total momentum, 

( 54 ) 

3 

is conserved because the forces that come from the potential energy obey Newton’s third law. 


Finding Symmetries 

So far, we have looked at systems in which it is pretty obvious what the symmetry transformation 
is. But when come upon some Lagrangian, you may want to find what transformation the La- 
grangian is invariant under. The way to do this is to write down how your Lagrangian transforms 
under a general infinitesimal transformation of the form 

Qj -»■ Qj = Qj + e«<fc(e) • (55) 

and then require that it be invariant - that is that the coefficient of the e term in 

£(q,q)-£(q,q) (56) 


vanishes. Here is a simple example, where you could probably guess the answer, but it will illus¬ 
trate the technique. 

C(x,x) = ^ x\ + - V(xi + 2x 2 ) (57) 

Does this Lagrangian has a symmetry? To see, we want to find C(x, x) where 


Xi 


Xj = Xj + e k x (x) 


(58) 


m i 


£(x, x) = — xi + ek Xl + — [x 2 + ek 


m 2 


V X 2 


V (xi + 6K, X1 + 2x 2 + 2 6K X2 ) 


— (x'i + 2,exik xi + • • + -j- ^2 + 2ex 2 k X2 + • • —V + 2x 2 + e(n xi + 2/ta ;2 )^ 


The e term will vanish if 


k x i kx 2 i^x i 2,k X2 0 


(59) 

(60) 

(61) 


One way to satisfy the first two conditions is to take the ks to be constant independent of x (if they 
depend on x, the time derivatives would be nonzero). Then the last condition implies that we can 
take 

kx, 2 k X2 1 (62) 
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which gives a transformation of the form 


x\ —> x\ — 2e ^2 —^ ^2 + e (63) 

You see that it is sometimes easy to find symmetry transformations. It is harder to show that you 
have found them all. This depends on details, in this case the precise form of the function V. We 
won’t talk about that now. 

Rotations 

Another simple example can be found in the example we discussed earlier of the mass mi sliding 
on a horizontal frictionless table, connected to a string that goes through a hole at the origin to 
a mass m 2 hanging below the table with Lagrangian (16). This physical system has a symmetry 
under rotations about the z axis, which add a constant to the angle 9 without changing i, 

t -> l = 1 7 9 ^9 = 9+ e. (64) 

This symmetry is responsible for the fact that the Lagrangian does not depend on 9 at all, but only 
on 9, because the condition for symmetry, that 

c(i,e,e,o) = £(e,9,i,e) (65) 

in this case becomes 

C(£,9+ e,l,9) = L{£,9,i,9) (66) 

This just says that if we make a little change in 9 in the function, nothing happens, so the function 
must not depend on 9. If we wanted to say the same thing in fancier mathematics, we could say 
that because e is infinitesimal, we can reliably Taylor expand the left-hand-side of (66) and keep 
only the first two terms, 

C{£, 9+ e,l,9) = C{£, 9,1,9)+e 4-£(f, 9, £, 9) (67) 

Ou 

Putting (67) into (66) implies 

^C{e,9,i,9) = 0 ( 68 ) 

which as promised is the statement that C does not depend on 9. 

As we saw in lecture 7, this system also has a conserved quantity, the angular momentum 

L = mi£ 2 9 . (69) 

So again, we see a connection between a symmetry and a conserved quantity. 
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Noether’s theorem 


Having seen a correlation between symmetry and conservation laws in a couple of examples, let 
us now consider how this works in more generality and see if we can pin down the connection 
precisely. Putting (32) into (33) and Taylor expanding gives 


£(<?, q) = C(q + eK,q + ek) = C(q, q) + e ^ 


dC 


K, 


Qj 


qj •' ’/./ qj ^ ( ii 

Thus the condition that (32) is a symmetry of the Lagrangian is equivalent to 


dqj 


£ E % 


2> 

Qj 


dC _ dC 
dqj + ^ Kqj d'U 


(70) 


(71) 


For example, for space translations, (50), 


K 


= 1 


(72) 


for all j, because all the xj s are translated in the same way. Thus k 
symmetry becomes 


E 




dC _ ^ dC 

dXj dxj 


0 and the condition of 
(73) 


This is equivalent to the statement that C depends only on differences Xj — X/,. 
Now return to the general case and consider the quantity 


E 




dC 

dqj 


Consider the time derivative of this quantity 



If we apply the Lagrange equations of motion to the second term, it becomes 


d 

dt 



= E 


hi. 


<lj 




dqj 


^ dC n 


(74) 


(75) 


(76) 


which vanishes because of the condition (71) that the Lagrangian is symmetric. This is a very 
important general theorem. It is the precise connection between a continuous symmetry and a 
conservation law. For every continuous symmetry of the form (32), there is a conservation law — 
the quantity of the form (74) is constant. 

Now we can find the conserved quantity for any symmetry. For example for the Lagrangian of 
(57), where the ks are given by (62), the conserved quantity is 


k 7 


dC 

dx i 


k 7 


dC 

' dxo 


—2 miXi + m 2 x 2 


(77) 
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This theorem is a special favorite of mine because a lot of my own work in particle physics 
is based on it. It was worked out early in this century by the great woman mathematician and 
theoretical physicist Emmy Noether. 1 
The quantity 


dc 

dqj 


(78) 


that appears in Noether’s theorem is a very important one in classical mechanics. It is the “gen¬ 
eralized momentum” we talked about earlier, corresponding to the coordinate q v we discussed in 
the last lecture. Of course, if q 3 is a normal space coordinate (like x 3 ), and the kinetic energy has 
the standard form, it is the momentum (or a component of it). In fact, one way of thinking about 
(76) is to recognize that in a sense (which I will not explain in detail) (74) is the generalized mo¬ 
mentum associated with, e, the infinitesimal variable that describes how all the q 3 s change under 
the symmetry. 


Momentum conservation from Noether’s theorem 


For space translation symmetry, because all the n Xj are equal to 1, the quantity (74) becomes 


5Z K * 


dC 


_ V — 

3 dij ■“ d±j 


(79) 


which is just the sum over the momenta of all the individual particles. Space translation invari¬ 
ance implies that the total momentum is conserved. 

This analysis can obviously be extended to three dimensions, where the coordinates and the 
corresponding momenta become vectors. 


More on rotations 

Here is a more involved example that we can discuss if we have time. When we thought about 
rotations for the bead sliding on the table, we were already using polar coordinates. This makes 
rotations easy, because an infinitesimal rotation is just a translation of 6 , 

e 6' = 9 + e. (80) 

This is an example with two particles in Cartesian coordinates. Consider two particles, with masses 
mi and m 2 , which move in a plane with coordinates f) = (x\, y\) and r 2 = (x 2 , y 2 ). Consider the 
following Lagrangian: 

(x'\ + yf) + ^rn 2 (x 2 + y 2 2 ) - V (x 2 + y\ + x\ + y\ - 2(xia; 2 + yiy 2 )) (81) 

'For more information about Noether, see The Life and Times of Emmy Noether by Nina Byers — 
http://xxx.lanl.gov/abs/hep-th/9411110 and on the 16 website in handouts/noether.pdf. 
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This Lagrangian is invariant under the following infinitesimal transformation: 

Xi^Xi = Xi - eyi 
x 2 -> x 2 = x 2 - ey 2 

If you have never seen anything like this before, it is not obvious at all, but you can see it by explicit 
calculation. For example, look at the mi terms in the kinetic energy. First note that 

xi -> xi = xi - eyi yi^yi = yi + ex! ( 83 ) 


yi^yi = yi + ear 
y-2 ~ 1 ” y2 — y2 + 1%2 


( 82 ) 


then substitute 

x'i + y\ = (ii - eijij 2 + (yi + cxi ) 2 = x{ + y x 2 + C>(e 2 ) (84) 

The e 2 terms do not cancel, but we don’t care about them because e is infinitesimal - we only 
consider linear terms in Noether’s theorem. So this is good enough. All the other terms work 
similarly. You can see what the rotation looks like pictorially in the figure below. 



The heavy lines represent the vector r — (x. y) from the origin to the original point, and the smaller 
vector from r to f = (x. if). Because e is infinitesimal, these two are nearly perpendicular, and the 
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ratio of their lengths is about e. It is helpful to complete these two lines into a similar triangle. 



Then blowing up the box, you can see that the change in x under the rotation is —ey and the change 
in y is ex. 
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( 86 ) 



Thus the conserved quantity is 


(87) 


E m A x jVi - VjXj) 

j 

As we will see in more detail later in the course, this is the angular momentum. 

Notice also that the complicated function of the variables that appears in V can be written as 

(f i - r 2 ) ■ (fi - f 2 ) = |fi - r 2 | 2 (88) 

just the square of the length of the difference fj — r 2 . 

What functions are invariant? 

We have seen some examples of functions that are invariant under transformations, such as the 
potential in (51), 

V (X!-X 2 , x 2 -x 3 , • • • , Xj-X j+ 1 , • • • , Xn-X-Xn) . (89) 

It always straightforward to check that a given function is invariant under a specific transforma¬ 
tion. And we have seen how to find transformations that are symmetries, if they exist. It is also 
important to be able to go the other way and to construct the most general function invariant under 
a transformation or set of transformations. 

To illustrate what I am talking about, let’s show that a function of F(xi,x 2 ) that is invariant 
under the symmetry transformation 

Xj —» Xj + e (90) 

for j = 1 and 2 actually only depends on the difference, x\ — x 2 . One way to do this is to change 
variables to include the variable X\ — x 2 , and eliminate X\ — 

y = x i-x 2 xi = x 2 + y (91) 

Then we can define a new function 

G(y,x 2 ) = F(x 2 + y,x 2 ) (92) 

in terms of the original function. But now, in terms of the new variables the transformation (90) is 

y ^y , X 2 -> x 2 + e. (93) 

Now since F is invariant, G must be also, because we have only relabeled things. Thus 

G(y,x 2 +e) = G(y,x 2 ) (94) 

for infinitesimal e. In words, this says that making an infinitesimal change in x 2 doesn’t affect the 
function, so it is probably obvious that this means that G(y, x 2 ) is independent of x 2 . But if we 
want to be more formal about it, we can use the Taylor expansion to get 

G{y, x 2 + e) = G(y , x 2 ) + eJ^-G(y, x 2 ) 4-= G(y, x 2 ) (95) 

UX 2 
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and thus 


~^G(y,x 2 ) = 0 (96) 

which means that G doesn’t depend on x 2 , so we can take x 2 to be anything in G. Thus using (91) 

F(x 1 , x 2 ) = G(x 1 - x 2 , x 2 ) = G(x 1 - x 2 , 0) = f(x 1 - x 2 ) . (97) 

It is easy to extend this proof to show that the most general function invariant under the transfor¬ 
mation (90) for n variables, x :) for j — 1 to n is given by (89). 

Notice that what we are doing here makes Noether’s theorem seem a little trivial. If all we 
do with invariance is to show that there is some variable that the function doesn’t depend on, 
then we could have changed variables first and then found the conserved quantity by just using 
the statement that the generalized momentum associated with a variable that doesn’t appear in 
the Lagrangian is conserved. And in fact, in this course, you can always do that. But there is 
actually more to Noether’s theorem, because transformations that leave a system invariant have an 
additional interesting property. They form what mathematicians form a group. The group property 
is quite powerful and often allows you to extend the infinitesimal transformations that we start with 
to a much larger set. For example, in the case of translations, group theory can be used to show 
that if things are invariant under (90) for infinitesimal e, they are also invariant for finite e. Then 
we don’t need the Taylor expansion any more. For example, in (94), we could first set x 2 to zero 
to get 

G(y,e) = G(y, 0) (98) 

and then relabel e —» x 2 to get 

G(y, x 2 ) = G(y, 0) (99) 

which directly gives (97). We won’t pursue this approach much in the course but I may not be able 
to restrain myself from talking more about it, because it forms the basis of much of the research 
that I have done in my scientific career. 

Example of functionals - Soap bubbles 

I thought that it would be interesting to discuss further the pretty mathematics of calculus of vari¬ 
ations in a slightly different context. I hope this may give you a better feel for it. There are many 
examples of the use of the calculus of variations. One nice one is to the shape of soap bubbles. The 
connection here is that soap bubbles have a surface tension, so they want to minimize their surface 
area. But the surface area is a functional of the function that describes the shape of the bubble. So 
this is a job for the calculus of variations. Here’s a specific problem that is kind of fun. Consider a 
bubble formed between two circular loops of wire, both centered on the z axis, one in the z — z\ 
plane with radius rq, and the other in the z = z 2 plane with radius r 2 . We’ll assume that z\ < z 2 . 
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So it looks something like this: 


z 

A 






( 100 ) 


The soap bubble will be some surface of revolution, because of the cylindrical symmetry of the 
system , indicated by the dotted line in (100). Thus we can specify the shape of the bubble by 
giving the radius as a function of z, r(z), subject to the constraint 


r(zi) = r\ r(z 2 ) = r 2 


( 101 ) 


which is just the physical requirement that the bubble is attached to the frame — but it looks just 
like the condition we impose on trajectories in Hamilton’s principle. Thus we can compute the 
shape exactly by solving an Euler-Lagrange equation. 

We want to minimize the area of the surface of revolution described by r(z) from z — Z\ to 
z = z 2 subject to the constraint (101). First we must compute the area. Consider the area of a small 
band of surface between z and z + dz. The circumference of the band is approximately 2nr(z) 
(the difference between r(z) and r(z + dz) does not matter here). The width of the band can be 
computed by looking at a slice through the z axis, which looks like this: 


r(z + dz) ~ 
r(z) + r'(z) dz 



z + dz 

z 


( 102 ) 


Evidently, the width of the slice is approximately 


\/dz 2 + dr 2 = dz yj 1 + r'(z) 2 


(103) 
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Thus the area of the slice is 


dza(r(z),r'(z)^ where a(r, r') = 2nr Vl + r' 2 


(104) 


and the total area is 


A[r] 



a 


(r( 


z , r (z 


dz 


27T f r(z) J 1 + r'(z) 2 dz 

J Z\ 


(105) 


Now we can use the calculus of variations. The mathematics of this problem is the same as in 
finding the vanishing variation the action, S[x], so the condition is the analog of the Lagrange 
equation, 


SA[r 

Sr(z) 


= 0 


(106) 


We can do the functional differentiations in the same way as above to the analog of the Lagrange 
equation — 


i a( - ry) ~ Tzt a(ry)=0 <107) 

But rather than trying to solve the Euler-Lagrange equation directly, it is useful to extend the 
analogy with Lagrangian mechanics and notice that because a does not depend explicitly on z, 
there will be an analog of energy that is independent of z: 


dC , da 

q — - C^r — - a 
oq or' 


(108) 


, da 


, <9 


r -- a — r —— ( 2ir ry 1 + r 

or dr 


- (^2irr\J 1 + r' 2 ^ — 2nr\J 1 + r' 2 


= 2nr 


= r' 2irr 
/ r /2 


+ r ' 2 


2nr v 1 + r 


/2 


/ 2 


Vl + r ' 2 


1 + r 
Vl + r ' 2 


2nr 


Vl + r ' 2 


r 


Vl + r ' 2 


= r 0 for some constant r 0 


We could mess around with this — but it easier to guess the answer. 


r(z) = r 0 cosh 


r'(z) = r n — sinh — = sinh — 

v ' u r r o r 0 

Now the key step that might actually cause you to make this guess is the next one. 

1 + r'(z ) 2 = 1 + sinh 2 — = cosh 2 — 


Vl + r ' 2 


r o 


rncosh — 
_ lo_ _ r 

cosh — 
ro 


r o 


(109) 

(HO) 

(111) 

( 112 ) 

(113) 

(114) 

(115) 

(116) 
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In the demo I will show you, both rings have the same radius, ry = r 2 = R. Then we define the 
z coordinate system so that z = 0 is right in the middle. Then the fact that cosh(x) = cosh(— x) 
implies that b = 0 so that 

r(z) — r 0 cosh(z/ro) (117) 

and the situation looks like the diagram below (the red lines represent a side view of the rings and 
the blue curves are a slice through the bubble): 



One can analyze this analytically, but it is complicated, and since we have Mathematica around, 
we may as well just find an numerical solution for r 0 as a function of the ring separation Z for 
fixed ring radius R. This is shown below and in the notebook soapbubble.nb. 

1.0 

0.9 

0.8 

0.7 

0.6 

0.0 0.1 0.2 0.3 0.4 0.5 0.6 (1 18) 

Note that something happens if you try to make Z/R big. The maximum is ~ 0.6627 R, for which 
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the bubble looks something like the blue curves: 



For large Z/R (as you can see analytically in the appendix), there is no real solution for R/r 0 , 
which means that we cannot actually minimize the area in this way. Physically, what is going on 
is that for large z it is always energetically favorable to just keep narrowing in the center until 
the bubble actually breaks into two separate bubbles on the two rings. We should be able to see 
this in the demo. You can also get a sense of what is happening by looking at soapbubble.nb, 
increasing Z/Rio its maximum value, and then clicking on color 0, which toggles diagonal lines 
that represent a cross-section of the cone through the two circles. The system is unstable when the 
bubble gets beyond that cone. 

A Analytic treatment of the bubble 

Now 

R/r 0 = cosh(Z/r 0 ) = cosh (^(R/r 0 )(Z/R)j (119) 

is a transcendental equation that can be solved numerically for ciR in terms of Z/R. Because, from 
(117), 

r(0) = r 0 (120) 

the physical interpretation of r 0 is that it is the radius of the bubble at its narrowest point, in the 
center. When we are finished messing with the math, we will express things in term of r 0 /R, 
because that is easier to think about physically. 

It is actually much easier to solve for Z/R in terms of R/r 0 , because (119) can be rewritten as 
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a quadratic equation for e^ ro ^ z ^ R ^ 


R/r 0 = cosh ((R/r 0 )(Z/R)) = ( e (fl/ro)(z/fl)) + e ~W r o)(z/R))j / 2 
e 2 (R/ r o){z/R) _ 2 r/ r 0 e W r o)( z / R ) _)_ x = 0 


( 121 ) 


( 122 ) 

(123) 

(124) 


o. 
o. 

0 . 

0. 

0 . 

0 . 

r 0 /R 



with solution 



Expressing this in terms of r 0 we have 

| = ^ln(i?/r„ + VWr„) 2 -l) 
This shows a plot of Z/R versus r 0 /-R 


The first thing to notice about thus graph is that there are two possible values of r 0 /R for each 
values of Z/R. Physically, this doesn’t make sense. We can understand what is going by looking at 
the ends of the r 0 /R axis on the first graph. The rightmost point on the graph corresponds to Z = 0 
and r 0 = R which is the right solution because in the limit of very small Z the area is obviously 
minimized by an almost flat ribbon between to two rings. Conversely, the leftmost point, z — 0 
and r 0 = 0 is obviously not a physical solution. While the variation vanishes there, the area is 
not minimized. The leftmost point is some kind of saddle point. Now as we go up along the Z/R 
axis, the same situation obtains. The point on the right corresponds to the minimum. Thus the 
physical region of the first graph is only the part to the right of the maximum, which occurs for 
r 0 = t min ~ 0.5524 R. 
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Now that we know what is happening, we can display just the physical solution by plotting 
r 0 /R versus Z/R from r min / R to 1. 
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lecture 10 


Topics: 

The structure of science and common sense 

The speed of light 

Time dilation 

The twin paradox 

The Doppler effect 

The twin paradox and inertial frames 


The structure of science and common sense 



from www.raremaps.com 

The beautiful edifice of Newtonian mechanics, which we have seen a bit of in the last few 
weeks, provides a wonderful precise mathematical description of most of the things we see in our 
everyday world. It is obviously right. But it is also wrong. We have discussed qualitatively the 
underlying quantum mechanical reality from which Newton’s mechanics emerges as an approx¬ 
imation. In the next few weeks, we will discuss in quantitative detail the bizarre things that go 
wrong with Newton’s picture at large velocities. What is going on here? Newtonian mechan¬ 
ics beautifully captures the mathematical essence of what we know about the world in which we 
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have grown up. Once we get used to the mathematical language, it is perfectly in accord with our 
common sense understanding of the world. We feel in our bones that it is right. How can it be 
wrong? 

But it is wrong. I am going to tell you today that it is wrong and what is right and you will 
not understand me or believe it. Even if you have heard this before, and you think that you have 
internalized it, you are still not going to really understand it or believe it. In fact, you will not even 
have any sense of what it would mean to understand it or believe it. It is that strange. It doesn’t 
make any sense. 

The first thing to say is that there is no reason why our sense should have anything at all to do 
with what happens at extreme conditions, far from what we are used to in everyday life. We have 
some direct intuition about things that are about our size - and maybe a few powers of ten bigger 
and smaller. We can feel in our bones what happens for accelerations not much different than g, 
and velocities like those we are used to. But if we go far outside this familiar range of parameters, 
it would be rather surprising if our common sense worked very well. We should be prepared for 
surprises. If anything, what should surprise us is that our common sense works as far as it does. 
We have to go to really enormous velocities, on our everyday scale, before Newtonian kinematics 
starts to break down. And atoms, which exhibit quantum behavior in all its glory, are very small. 
This is a theme that we will return to several times. 

The wonderful thing about the discipline of modern science is that we can say sensible things 
about phenomena even when our sense doesn’t work. We do this by keeping ourselves firmly 
grounded in what we understand, but at the same time recognizing the limitations of our knowl¬ 
edge. It is useful to think of science as a map of a peculiar space - the space of parameters that 
describe physical phenomena, things like size, mass, speeds, temperatures, etc. We have all grown 
up in a familiar, comfortable neighborhood described by a small region on this map. But we have 
expanded our knowledge of the terrain in much the same way that ancient explorers improved their 
maps of the known world. We work our way out from what we know into the unknown, exploring 
and pushing the boundary of what we know farther and farther in different directions away from 
the range of phenomena that we see in the everyday world. In the next few weeks, we are going to 
discuss one of these directions — the realm of the very fast. When it is strange, don’t be surprised. 
The reason that you don’t understand is very simple. It is that you are slow! Not mentally slow, 
but physically slow. You have spent all your life moving at speeds very very tiny compared to 
the speed of light, so nothing in your experience has prepared you for the phenomena that happen 
all the time at large speeds (which we call “relativistic” speeds — a rather bizarre grammatical 
construction if you think about it, but standard). Try to bear that in mind when it seems that what 
we are doing doesn’t make any sense. 

The speed of light 

The speed of light is exactly 299,792,458 m/s. What “exactly” means in this case is just what it 
says. Because the speed of light, as we will see, is is a kind of cosmic speed limit built into the 
structure of space and time, it makes sense to use it to define our unit of length (the meter) in terms 
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of our unit of time (the second). This is what is done in SI, the International System of Units. It 
is no longer necessary to keep a “standard” 1 meter bar in a vault someplace. The second is now 
defined in terms of a particular oscillation of an atom in an atomic clock. The meter is then defined 
as the distance that light travels in 1/299,792,458th of a second. I should say that when I talk about 
the speed of light, I always mean the speed of light “in vacuum” — that is in empty space. 1 Things 
get more complicated in material like glass because the interactions of the light with the material 
can slow the light down. 

Now 299,792,458 m/s is fast. It is a heck of a lot faster than we can actually move ourselves. 
But it is certainly not infinitely fast. With modern electronics, we can measure very short times, so 
it is not impossible to see the effect of the finite speed of light even over fairly short distances. The 
point I am trying to make here is that while motion at close to the speed of light is far beyond our 
everyday experience, it is not science fiction. In fact, we routinely measure the speed of light, and 
routinely see things (small things like electrons, but things nevertheless) moving at speeds very 
close to the speed of light. 

But the surprising thing about light in a vacuum is that the speed of light that we measure 
doesn’t depend on the velocity of the object that produced the light, and it doesn’t depend on the 
velocity of the measuring apparatus. Now if you think that you understand this, you obviously 
have not been listening carefully enough, because this doesn’t make any sense at all. Nevertheless, 
it is true. If, for example, I am running towards a light-bulb at speed v carrying a light-speed 
meter, a device to measure the speed of light, all of you sitting at rest see the light from the bulb 
approaching me at a speed v + c. But when I do the measurement, I get the same value for the 
speed of light that I would get if I were standing still. In fact, I get the same value that you would 
get measuring the same light beam in about the same place at about the same time, but standing 
still. The same thing happens if I am running away from the light source. 


c=299792458 


c=299792458 


c=299792458 
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This is absolutely crazy. Surely if I am moving towards the light beam, I should register a larger 
speed on my light-speed meter. That is what common sense would say. However, that is not the 
way the world works. The way the world works is that the speed of light in vacuum is constant, 
period! It is not that something goes wrong with my light-speed meter. This bizarre fact is built 
into the way the world works. 

The full power of this remarkable fact, the constancy of the speed of light, is unleashed when 
we combine it with another, much more reasonable fact about the way the world works — the 
principle of relativity. The principle of relativity says simply that all uniform motion is relative. 

'The notion of “empty space” is itself rather problematic. Even classically, space is only completely empty at 
absolute zero. And when we include the effects of quantum mechanics, as we will see much later, empty space begins 
to look anything but empty. Nevertheless, there is a well-defined meaning to the notion of the speed in light in vacuum. 
Its role as a cosmic speed limit survives all this extra complication. 
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There is no absolute sense in which I can say I am moving. There is no preferred notion of standing 
still. In a moment, we will formalize this idea with the notion of an inertial frame of reference. 
Note that we can tell if our motion is not uniform. Acceleration is accompanied by forces that we 
can feel in our bones. But uniform motion is not detectable, so long as everything else we need 
is moving along with us. This, of course, is something that feel in our bones for the slow motions 
that we are used to. We all know this very well from travel in vehicles, cars, trains, planes, and 
whatnot. We are going to assume, with Einstein, that it remains true at relativistic speeds. 

Inertial frames 

The idea of an inertial frame of reference or just “inertial frame” for short, is one that already 
plays an important role in non-relativistic mechanics. It is an attempt to formalize the notion that 
motion is relative in an operational way. To do this, we must carefully describe what velocity 
means by describing precisely what we need to measure it. 

On the surface, the speed of light does not seem to be a complicated concept. You measure 
it in the obvious way with clocks and meter sticks, by dividing the distance traveled by the time 
taken. But first, you have to synchronize your clocks! This is where the idea of an inertial frame 
comes in. An inertial frame is a real or imaginary collection of clocks that are fixed with respect 
to one another and synchronized, for example by requiring that some signal that originates midway 
between each pair of clocks arrives at the two clocks at the same time. 2 In addition, an inertial 
frame must not be accelerating, which is easy to check because you can just demand that Newton’s 
laws hold for small velocities — free particles travel in straight lines, that sort of thing. 

So we have two fundamental principles. 

A. That the laws of physics are the same in all inertial frames, and 

B. That one of the laws of physics is that the velocity of light is a constant — with the same value 

in all inertial frames, 

As you will see in more detail in the notes, these two principles are amazingly powerful. They will 
revolutionize our picture of space and time. Now let’s see some of the consequences of putting 
these two ideas about the world together. 

Time dilation 

Let’s start with one of the strangest and most trivial of the consequences of relativity — time 
dilation. The phenomena of time dilation can be stated precisely as follows. Observations done on 
a single clock moving with speed v with respect to a number of clocks fixed in an inertial frame 
show the ticking of the moving clock slowed down by a factor of ^ 1 — v 2 /c 2 . The standard way 

2 It doesn’t matter for this purpose whether we are using a light signal or the Pony Express, as long as the signal 
travels at the same speed in both directions! 
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of deriving this result is to consider an idealized clock made out of two parallel mirrors and a pulse 
of light bouncing back and forth between them: 


mirror 


light 

pulse 


mirror 

A light-clock at rest 


( 2 ) 


If the distance between the mirrors is L, the time for each tick of the clock, defined as the time for 
the pulse to get from one mirror to the other, is L/c. Now suppose that the two mirrors of this light 
clock are mounted on parallel tracks a distance L apart and the two mirrors moved down the tracks 
with velocity v. Now the system looks to observers in the fixed frame as shown below 


mirror 



mirror 

A light-clock in motion 


Obviously, from the point of view of the many clocks in the inertial frame, the light pulse has to 
go farther when the single light clock is moving. Thus if light always travels at the same velocity, 
the ticks of the light clock take longer when it is moving. Call the factor by which the ticks are 
longer 7 . Then we can compute 7 as follows. Each vertical transit of the light from one mirror 
to the other in the moving frame takes time 7 L/c (just from the definition of 7 - because L/c is 
the time for a tick in the rest frame). And because light travels at the same speed, c, the length 
of the path from one mirror to the next therefore must be 7 L. The light pulse moves vertically a 
distance L (because the tracks are a distance L apart) and horizontally a distance v ■'yL/c (just the 
velocity of the light clock multiplied by the time). Now we look at the geometry of the motion. 
Then Pythagoras tells us that 

(7 L) 2 = ^Lv/c) 2 + L 2 ( 4 ) 


which implies 



( 5 ) 
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So time is no longer sacred. And this can’t just be a special property of light clocks, because if we 
used some other kind of clock to measure the time, and got a different result, then we would be 
able to distinguish between the moving frame and the frame in which the light clock is fixed. But 
this violates the principle that all frames are equivalent. Every kind of clock must tick out seconds 
at the same rate in all inertial frames. 

Incidentally, this factor 7 is going to reappear all the time, so it pays to actually either memorize 
it, or to be able to reproduce the light-clock argument in real time so you can get it whenever you 
need it. 

It is quite easy with modem electronics and atomic clocks to see relativistic effects like time 
dilation. In fact, both special and general relativistic effects are very important in one very practical 
application — the Global Positioning System which is based on a system of atomic clocks aboard 
satelites. The relativistic corrections are small, because the satellites are traveling at “only” about 
4000 m/s, but enormous accuracy is required to make GPS work and the relativistic effects must 
be properly included. See for example http://www.physicscentral.com/writers/2000/will.html. 

Even more dramatic examples of time dilation occur all the time with elementary particles. 
That seems like a lot to ask of tiny particles that are supposed to be elementary and have no 
internal structure. But the fact is that quantum mechanics provides us with internal clocks for 
many elementary particles because they are unstable, and when they are sitting still and evolving 
in time, they have a constant probability per unit time of decaying into other lighter particles. We 
can actually see these internal clocks ticking (at least in an average sense) by watching the particles 
decay. The observed lifetime of unstable particles is a tangible measure of how fast these internal 
clocks are ticking. We see this all the time in particle experiments. But we are relying on another 
fact — all particles of a particular kind are exactly the same. We never actually measure the decay 
rate of the same decaying particle in two different frames. But we can quite easily measure the 
lifetime of a particle at rest, and then measure the lifetime of the same TYPE of particle in a 
moving frame. We find that the ratio of the lifetimes is 7 . Since all particles of a particular type 
are identical, this is just as good. 

Another important thing about time dilation is that although it is strange, it is probably the 
easiest of the relativity principles to remember and use. The thing to remember is 


The single clock measures the shorter time. 


( 6 ) 


If you keep this in mind, and just remember that 7 > 1, you will always be able to reconstruct the 
right formula. Use time dilation whenever you can to solve problems! 


The twin paradox 

Now you may very well be thinking, at this point, that once you define what you are talking about 
carefully, with inertial frames, that there isn’t anything particularly strange about motion at rela¬ 
tivistic speeds, but that we have just confused the issue with a bizarre definition of measurement. 
Even our experiment on decays of elementary particles might be just a matter of a bad definition 
of what we mean by the ticking of their internal clocks. Perhaps, you think, that there is some 
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other way of constructing our light-speed meters so that the speed of light is not constant and the 
bizarre features of relativity go away. Think again! Perhaps the simplest way of making clear that 
something totally bizarre is going on is to discuss the twin paradox. This is a classic thought exper¬ 
iment in which one twin takes a trip on a rocket moving at relativistic speeds, while the other twin 
remains at home. When the traveling twin returns, because his clocks have been ticking slowly, he 
is younger than his twin. His biological clock is no different from any other clock. Relativity has 
slowed down the aging process. If this is not strange, I don’t know what is. 

Again, this experiment has not been done conclusively with people. Astronauts in MIR or the 
space shuttle do age less rapidly than the rest of us, but the difference is sufficiently small at the 
speeds of mere orbital motion that the don’t see a huge difference in biological clock (which aren’t 
very accurate). The difference can be measured by atomic clocks. And a number of very accu¬ 
rate experiments have been done showing exactly this effect with the internal clocks of unstable 
elementary particles. 

The twin paradox is so peculiar that I want to work out an example of how it looks to the two 
twins who are aging differently. To do this, it is useful to first understand the relativistic Doppler 
effect. 


The Doppler effect 


While the speed of a light beam does not change when we go from one inertial frame to another, 
its frequency does change. This is not surprising, since the same thing happens for sound or any 
other wave. It is called the Doppler effect and shows up, for example, in the change in sound of 
a train whistle when the train goes by. First, let me remind you how the Doppler effect goes for 
sound, or any other wave when we are moving at nonrelativistic speeds. Suppose that a train is 
moving towards me at speed v and its whistle emits sound waves which, when the train is at rest, 
have frequency u 0 , and wavelength Ao- The speed of sound, V s , is the product of the frequency and 
the wavelength: 

V a (m/s) = A 0 (m/cycle) • v 0 (cycles/s) (7) 

The inverse of the frequency is the period of the sound wave, which is the time between successive 
crests of the wave. So now let us look at two successive crests of the wave as the train moves 
towards us — illustrated below: 


sound meter 
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Because the train is moving forward as it emits the wave, the crests are closer together than they 
would be if the train were standing still. The distance between crests for the train at rest is just the 
wavelength, 

V. 

A 0 =* - (9) 
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The distance between crests for the moving train is 


V 8 ~ v 
v 0 


( 10 ) 


Thus the wavelength of the sound as recorded at the sound meter is reduced by the nonrelativistic 
Doppler factor 


V s -v 

Vs 


( 11 ) 


Because (7) must be satisfied, the frequency is increased by the inverse of (11), and the train whistle 
has a higher pitch when it is moving towards us. 

If the train is moving away, the argument is exactly the same — we just have to replace v —> —v 
in ( 11 ). 

Now suppose we do a similar thing, but replace the train with a rocket moving at relativistic 
speed, and replace sound with light. The speed of light is the product of the frequency and the 
wavelength: 

c (m/s) = A 0 (m/cycle) • u 0 (cycles/s) (12) 


The inverse of the frequency is the period of the light wave, which is the time between successive 
crests of the wave. In the diagram, almost everything is the same, except that because of time 
dilation, the time between the emission of successive crests of the wave is longer than l/i / 0 by the 
ubiquitous factor of 7 , because the moving clock ticks more slowly. Thus the picture looks like 
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Because the rocket is moving forward as it emits the wave, the crests are closer together than they 
would be if the rocket were standing still. The distance between crests for the rocket at rest is just 
the wavelength, 

A 0 = - (14) 

Vo 

The distance between crests for the moving rocket is 


_ 7 (c - v) 

Ay 

V 0 


(15) 


Thus the wavelength of the light as recorded at the light meter is reduced by the relativistic Doppler 
factor 


7(0 — v) 
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Because (12) must be satisfied, the frequency is increased by the inverse of (16), 


1 + v/c 
\ 1 — v/c 


( 17 ) 


and the light has higher frequency when the rocket is moving towards us. This is called blue-shift 
because raising frequency in the optical spectrum is a shift towards the blue. 

Again, if the rocket is moving away, the argument is exactly the same — we just have to replace 
v —> —v everywhere. This is called red-shift because lowering frequency in the optical spectrum 
is a shift towards the red. 

There is one very important distinction to note about the relativistic Doppler effect versus the 
nonrelativistic version. In the relativistic version, it doesn’t matter whether the rocket is approach¬ 
ing the observer at speed v or the observer is approaching the rocket at speed v. It can’t, because 
of the principle of relativity. This is not true for the nonrelativistic Doppler effect because the air 
in which sound moves defines a special frame. 


The twin paradox and inertial frames 

We can now use the Doppler effect to understand time dilation and the twin paradox, by keeping 
track of every tick of the moving clock. So we have two twins, who are, as twins often are, the 
same age. But one twin (called the “rocket twin”) takes a trip to planet X, at distance L from earth, 
staying in constant communication with the “earth twin” behind on earth. The communication 
is done using radio waves, or some other electromagnetic waves that have fixed frequency in the 
rocket twin’s frame (the rocket frame). We will assume the typical form of the twin paradox that 
we discussed above, where the rocket twin goes out to planet X at speed v, quickly turns around, 
and returns at the same speed. 

Here are a series of snapshots showing the important times during the trip, the rocket twin 
leaves the earth twin at t — 0 traveling at constant v - staying in constant radio communication 


t = 0 



the rocket twin arrives at planet X at t = L/v, turns around (quickly) and sends a turn-around 
signal (shown as the open triangle < in the diagrams) to the earth twin, indicating that he has 
reached the planet. 


t = L/v 
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The turn-around signal reaches the earth twin at f = tx = L/v + L/c 


t — tx 


v R 


— 4 — 
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t = 2 L/v 


twin are 

reunited at t = 2 L/v 
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The radio transmitter is a clock - number of ticks (cycles) is tv where v is frequency of trans¬ 
mitter. Thus the number of ticks sent by the rocket twin is T v, where T is the total time he aged 
on the trip. 

But the earth twin receives red shifted photons for a time tx = L/v + L/c and blue shifted for 
2 L/v — tx = L/v — L/c. Therefore, the number of cycles received by the earth twin is 
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Thus 


T= — Jl-v 2 /c 2 


Nonrelativistically, we would have expected 


_ 2 L 

T = — 
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Thus the earth twin “sees” rocket clocks ticking more slowly by a factor of 

\Jl - v 2 /c 2 = 1/7 


(18) 


(19) 

( 20 ) 


( 21 ) 


( 22 ) 


This is time dilation. The moving clock ticks more slowly as seen by the clock at rest. This is also 
the twin paradox. Because the rocket twin has sent out fewer ticks, he has also aged less. He is 
younger than the earth twin when he returns. 

There are a couple of other things to notice about (22). First note that the two terms in ( 22) are 
equal. This had to be the case, because the rocket twin sent the same number of ticks on the way to 
planet X as on the way back, so the earth twin received the same number of ticks in the red-shifted 
signal from the trip out as in the blue-shifted signal from the trip back. 
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But now, you say, why isn’t the situation symmetrical? After all, from the rocket twin’s point 
of view, the earth twin (along with the rest of the earth) has moved away at speed v and then come 
back at the same speed. Why is it that the rocket twin is younger at the end, rather than the earth 
twin? Let’s look at the trip from the rocket twin’s point of view, assuming that it is the earth twin 
who is sending out radio signals the whole time. Now things look a bit different. Here is the 
chronology. At t — 0, the rocket twin watches the earth twin and the earth recede at constant 
v. The rocket twin receives radio signals from the earth twin that are red-shifted until planet X 
appears. Planet X is shown as dashed in the figure because it is not in the same inertial frame as 
the rocket twin’s ship, so the rocket twin has to be a whiz at relativity to calculate its position. 


t = 0 


R 


<— v E 


X 


Planet X reaches the rocket twin at t = T/ 2, turns around (quickly) and starts to recede again. 
From this point on, the rocket twin receives blue-shifted signals until earth reappears and he is 
reunited with the earth twin. 
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the rocket twin and the earth twin are reunited at t = T 


t = T 


R 


E 


X 


Now we can check that the two pictures are consistent. The number of cycles that the rocket 
twin receives is 
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which using ( 20 ) is 
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(25) 


which is the frequency times the time elapsed on earth, as we expected. I hope you can guess from 
this discussion what the asymmetry is. Of course I have not quite described this process the way 
the rocket twin experiences it. He doesn’t just watch earth receding at t — 0. He blasts off and 
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accelerates. Similarly, he doesn’t just watch planet X turn around. He decelerates and accelerates 
again in the opposite direction. He feels these accelerations in his bones! From the rocket twin’s 
point of view, the switch from red-shift to blue-shift occurs the moment he turns around. This 
makes sense. He knows that he has turned around. But from the earth twin’s point of view, nothing 
special happens when the rocket twin reaches the planet. He has to wait until the tum-around signal 
arrives to see the shift from red-shift to blue-shift. 

Here is another way of saying what the difference is. The earth twin, remaining on earth, is at 
rest in a single inertial frame the whole time. The rocket twin is not. He is in one frame on the way 
out, and in a different frame on the way back. It is the fact that the rocket twin must switch from 
one inertial frame to the other that makes his experience different. 

Relativity may be strange, but it is consistent. 
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lecture 11 


Topics: 

Where are we now? 

Space-time events 
The space-time interval 
The invariant interval 
Lorentz transformations 
Invariance as a way of thinking 
Varieties of space-time intervals 
Interlude on relativistic units 
The tip of tomorrow 

Where are we? 

We are lost in an odd 4-dimensional space! That’s where! I hope the last lecture and the readings in 
Morin have convinced you that relativity is strange. We have discussed time dilation and the twin 
paradox in the last lecture. You should have read about the relativity of simultaneity and Lorentz 
contraction in David’s book. By next week, I will be able to explain why I am so sure that relativity 
is true. For now, just try to accept that as strange as this is, there is absolutely convincing evidence 
that the world really works this way. Since we are stuck with it, we should figure out how to make 
sense of it, and how to avoid getting confused by the strangeness of it all. To do this, you must 
train yourself to consider space and time together, because what is time in one inertial frame is a 
combination of time and space in another. There are a number of closely related ways of dealing 
with this. I will talk today about a few things that you have already read about in David Morin’s 
book: space-time events, the space and time intervals and the invariant combination of the two, 
and Lorentz transformations, because I want to try to put them in some perspective. At the end of 
this lecture, I will try to put a lot of this together and discuss in more detail the question of why is 
it that you are not used to thinking about space-time as a unified, 4-dimensional thing. Of course 
it is because you are slow - but there is more to say that I think is rather interesting. 

Space-time events 

The concept of a space-time event is very simple. Even in non-relativistic physics, to describe an 
event completely you must specify not only where it is but also when it takes place. What’s the 
big deal? The importance of the space-time event in relativistic physics is really more of a negative 
one. There are many things that don’t make absolute sense in the relativistic world because they 
look different in different inertial frames. Time is relative. Distance is relative. Simultaneity is 
relative. The space-time event gives us something to hang on to. The coincidence of space-time 
events is NOT RELATIVE. If space-time events coincide, that is if they happen at the same place 
and at the same time, they coincide in all inertial frames. Space-time events are thus the sensible 
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things to talk about. Though the coordinates of a given space-time event will change depending on 
what inertial frame you’re in, the event itself does not change. 

The right analogy to keep in mind is that different inertial frames in space and time are like dif¬ 
ferent coordinate systems in three-dimensional space. Time and distance change in going from one 
inertial frame to another just as the x component of a point changes in going from one coordinate 
system to another. But space-time events in space and time are like points in three-dimensional 
space. If points coincide in one coordinate system, they coincide in all coordinate systems, just as 
space-time events are coincident in all inertial frames if the coincide in any one. 

One of the things that we will try to train you to do is to identify the important space-time 
events in a process, and to be very suspicious of any statement or question that cannot be described 
in terms of space-time events. Let me illustrate the difference in the twin paradox discussion we 
gave in the last lecture, where twin 1 left twin 2 on earth and traveled to planet X and back. Here 
the crucial space-time events were: 

Twin 1 leaves twin 2 — x — 0, t — 0 in Earth’s frame, with Earth at the origin; 


t = 0 


1 v —» 
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Twin 1 arrives at planet X — x = L, t = L/v 


in Earth’s frame; 
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The turn-around signal reaches twin 2 
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Twin 1 and twin 2 are reunited — 


x = 0, t = 2L/v in Earth’s frame. 


t = 2 L/v 
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These events have an invariant meaning even though their coordinates, both their space and their 
time coordinates, will change if they are viewed in a different coordinate system. 

It is perfectly meaningful to put together events, so long as they occur at the same place and 
at the same time. For example, again from the discussion of last lecture, “Twin 1 arrives at planet 
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X at t = L/v, turns around (quickly) and sends a turn-around signal to twin 2, indicating that 
he has reached the planet.” This describes two separate events — reaching the planet and sending 
the signal — but because they happen at the same place and at the same time, their space and 
time coordinates will be the same in all reference frames. Thus the statement makes sense in all 
reference frames 

If we don’t stick to space-time events, there are lots of questions that we can ask that don’t 
mean anything at all unless we specify the reference frame. For example 1 

Meaningless questions 

How far does twin 1 travel? 

How long does it take twin 1 to get to Planet X? 

Where is twin 1 when the turnaround signal reaches Earth? 

How old is twin 2 when twin 1 reaches planet X? 

The last two are a classic kind of pitfall, because they attempt to compare two events at different 
places. Think about them carefully. 

The space-time interval 

Let me illustrate some of these issues by talking about a particularly useful sort of space-time event 
— the tick of a clock. If I have a clock sitting at the origin of the reference frame of the lecture hall, 
ticking every T seconds, the ticks of the clock define a series of space-time events. In the frame 
of reference of the lecture hall, all of these events have space coordinates 0, because they are all at 
the origin, and their time coordinates are just 0, T, 2T, etc. Thus the time and space coordinates of 
the series of ticks would look like 

( 1 ) 

In general, if we hadn’t put the first tick at the origin, the r and t coordinates of the later ticks 
would all just have the initial r and t positions added on to them, so the sequence would look like 

( 2 ) 

We know that there is nothing very interesting about the r 0 and the t 0 , which can be changed by 
moving to a different coordinate system by translations in space and time. Thus it is useful to 

'The word “meaningless” in the table is a little strong. It might be fairer to say “questions whose answers depend 
on the reference frame,” but we are trying to get your attention! 
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get rid of this dependence. We can do this in a way that is completely independent of our choice 
of origin of coordinates by considering the space-time intervals, which are just the differences 
between the coordinates of space-time events 


AE 0 i — E l — E 0 — (Af 01 , Ary 01 ) — (fi — t 0 , E ~ E o) (3) 

(4) 

Events are the important underlying things, but have all this extraneous dependence on the origin 
of the coordinate system in space and time, so it is almost always useful to think about space-time 
intervals when you want to calculate something. 

So far there is nothing relativistic about this, but here it comes. We also know from our discus¬ 
sion of time dilation what these same events look like in a reference frame in which the clock is 
moving. Suppose that the clock is moving with speed v in the +x direction. Now because of time 
dilation, the ticks are spread out by a factor of 7 , so that (if we set our clocks so that the first tick 
is at zero) the time coordinates are 0, 7 T, 27 T, etc. Let us also assume that the space coordinates 
are 0 for the first tick (at t = 0). Then because we know the velocity of the clock, we can easily 
work out the other r coordinates - 

(5) 

Notice where the asymmetry of time dilation comes from here. We get these factors of 7 for 
the clock readings in our frame because we are using a series of our clocks at different places in 
our frame to observe the ticks of the single moving clock. This is always the way time dilation 
works. The single clock measures the shorter time! We in our frame see the 

Again, if we hadn’t put the first tick at the origin in space and time, the x and t coordinates of 
the later ticks would all just have the initial f and t positions added on to them, so the sequence 
would look like 

( 6 ) 





And again, we can get rid of the dependence on the origins in space and time by looking at the 
intervals: _ 


intervals 

AE’oi 

A£q 2 


(At', A f') 

( 7 T, xv^T) 

( 27 T, 2xv^T) 
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Finally, there is nothing special about the x direction. For a different moving clock (or a different 
choice of coordinate system), the motion might be rotated into a different direction, x —■» v 


intervals 

AE 0 i 

AE 02 


{At", A r ") 

( 7 T, vv'yT) 

( 27 T, 2vv r yT) 



While you are used to thinking of the difference between (1) and (2) or between (7) and ( 8 ) as a 
change of coordinate system, you are probably not used to thinking about the difference between 
(1) and ( 6 ) as a change of coordinate system. But what you see here is that these space-time 
coordinates really can describe the same set of events, the ticks of a clock, in different reference 
frames. Going from one reference frame to another IS like going from one coordinate system 
to another — it is just that the time “coordinate” as well as the space coordinates are involved. 
The events don’t change, but both the time and space coordinates change when we go from one 
inertial frame to another. What this means is that we have gone from a 3-dimensional space to a 
4-dimensional space-time. 

The invariant interval 

We can extend the analogy between going from one coordinate system to another and going from 
one reference frame to another in the following way. When you make a change of coordinate 
system in space, the distance between points doesn’t change. You can easily compute the square 
of the distance between two points labeled by vectors fj and fj in terms of their coordinates, 

f = (fj - f 2 ) • (fj - f 2 ) = ( r i - r () 2 (9) 

j=x,y,z 

Two things have happened here. First we have subtracted the coordinates of the two points. This 
eliminates dependence on the origin of the coordinate system, which just cancels when we subtract. 
Another way of putting this is that we are not interested in the length of the vectors themselves, 
because this depends on the arbitrary position of the origin. But if we make the vector from fj to 
fj by forming the combination rj — fj, it doesn’t depend on the origin. Then we sum the squares 
of the coordinates to get something that doesn’t change when we make a rotation. 

When you make a change of coordinate system in space-time, by going to a new reference 
frame, there is a similar thing that doesn’t change. If I have two events, event 1 at time t\ and 
position fj and event 2 at time t 2 and position fj, the following combination doesn’t change when 
we go from one reference frame to another: 

s 2 = c 2 (ti - f 2 ) 2 - (fj - fj) • (fj - fj) (10) 

doesn’t change when we go from one reference frame to another. Again we have done two things. 
The obvious one is to subtract the space and time coordinates of the two events, so that s 2 doesn’t 
depend on the origin of coordinates. The other, much less obvious one, is to combine the difference 
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in time and the difference in space in a very particular way. You can see that this is the right thing 
to do by looking at two successive clock ticks in any of (4), (7) or ( 8 ), we have 

s 2 = c 2 T 2 - 0 2 = c 2 T' 2 (11) 

In ( 6 ), we have 

s 2 = c 2 ( 7 t) 2 - (n 7 T) 2 = (c 2 - n 2 )T 2 /(l - v 2 /c 2 ) = c 2 T 2 (12) 

This is independent of v, so it looks the same in any frame of reference. 

The quantity s 2 in (10) looks a lot like the quantity i 2 in (9). There are three differences. One 
is that time is involved. The second is the minus sign in front of the second term. The third is the 
factor of c 2 in the first term. 

Let’s deal with the easy one first. The factor of c 2 is nature’s way of telling us that we are using 
a really stupid system of units. Because time and space get mixed up in relativity, and because the 
ratio v/c appears in many many of the relations of relativity, it makes sense to use units in which 
c = 1. One way of doing this is to use seconds as your unit of time and light-seconds, that is the 
distance light travels in a second, 299,792,458 meters. Of course, because we are so slow, this is 
an inconveniently long distance (which is why we don’t adopt this convention as a standard part of 
our metric system) but you will get used to it. Anyway, 1 is easier to remember than 299,792,458. 
So from now on, we will follow nature’s advice and set c = 1, which we will call “relativistic 
units.” This will make our formulas look simpler. We also lose something by doing this, but it is 
not very important, and we will come back and discuss it at the end of the lecture today. 

The other two differences between s 2 and ( 2 express the crucial strangeness of relativity. Time 
is not quite the same as space — that is obvious from the minus sign. But it is not completely 
different either, because it must be included to get something that looks the same in different 
reference frames. 

Notice that because of the minus sign, s 2 can have either sign. It is positive in our example 
of the interval between two ticks of a clock, but for other kinds of intervals, such as the interval 
between two events that are in different places but at the same time in some frame, s 2 can be 
negative as well. The quantity s 2 goes by various names. I will call it the “invariant interval.” 


Lorentz transformations 


You have read about a nice derivation of Lorentz transformation in David Morin’s text. I will write 
them down and talk about them. They look much simpler with c = 1. So suppose that we have 
two events 


event 1 at t\, f) and 
event 2 at f 2 , r 2 . 


(13) 


The interesting thing is the interval in time and space between these two events: 


At = t 2 - 1 1 , A r = r 2 - fq . 


(14) 
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Now if we look at these same two events from a reference frame moving in the -\-x direction 
with speed v, the coordinates of the events will change. We will get a new description of the events 
in terms of new coordinates, 


event 1 at t\, f{ and 
event 2 at t' 2 , r 2 . 


(15) 


Again, the interesting thing is the interval in time and space between these two events: 


A t' = t' 2 - t\ 


A f' = f 2 — r{ 


(16) 


Now the statement of the Lorentz transformation is that we find that we can choose a coordinate 
system in which 

Ax' = ^(Ax — vAt), 

At' = 7 (A t — vAx ), (17) 

Ay' = Ay , Az' = A z . 

If we set Ar = 0 and look at Ax' and At' as a function of At, we can recognize the moving 
clock. The condition Ar = 0 means that we are looking at a series of events that are fixed in the 
original frame. A pair of events with Ar = 0 and time difference At could be two ticks of a fixed 
clock a time At apart. 

A f— 0 


Ax' = j(—vAt), 
At' = j(At), 

Ay' = 0, Az' = 0. 


(18) 


The 7 in 


expresses time dilation. The —v in 


At' = ^(At) 


Ax' = j( —v At) 


describes the clock moving with x velocity —v in the new frame. 

The other terms in the Lorentz transformation can be understood in a similar way. 

Notice that the Lorentz transformations are linear transformations. This is important for many 
reasons, but one of the most obvious is that the time evolution of a clock in some reference frame 
can be described by a straight line in space and time. The transformation from one frame to another 
must take any such straight path into another straight path. 

It very important to understand why Lorentz transformations exist. They are a kind of space- 
time analog of rotations, but they look different because of the minus sign in ( 10 ). I recommend that 
you leam about Lorentz transformations, that you learn how to use them, but that you actually use 
them in a problem only if forced to do so (that is if the problem says “Use Lorentz transformations 
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to There are usually easier ways of understanding any given problem. Lorentz transformation 
will get you there in the end if you don’t make any mistakes, but you have to be very careful and 
systematic to use them properly. 

Invariance as a way of thinking 

Throughout this course so far, I have been subtly (and sometimes not so subtly) trying to get 
you to use symmetry arguments and invariance as a way of thinking about problems. In most 
of what we do, this is just a convenience - it makes hard problems easier. But in relativity, it is 
absolutely essential. The point of the principle of relativity after all is that inertial frames give 
equivalent descriptions of the physics. It is pretty obvious that you should use this fact whenever 
you can to understand what is going on. That is why so many of the problems you will do about 
special relativity are much easier when you choose some particularly convenient inertial frame 
and/or compute some appropriate invariant quantity. I want you to understand special relativity 
as much more than a confusing collection of formulas. Even though you will never learn to feel 
special relativity in your bones, you can leam to appreciate it as an interconnected web of relations 
between different ways of looking at the same phenomena. 

While Lorentz transformations are often not the best way to solve problems, they will never¬ 
theless be very important to our understanding of relativity. Lorentz transformations are part of 
the basic symmetry of space and time, along with rotations in space and translations in time and 
space, and the invariant interval is invariant under all these transformations. The symmetry trans¬ 
formations and the invariant quantities that they leave unchanged are two sides of the same coin. 
We will very often use the fact that the invariant interval looks the same in all inertial frames. And 
we will find other invariant quantities as well. So while we will not always be using Lorentz trans¬ 
formations explicitly, they will be with us implicitly because we will often be using the quantities 
that they leave invariant, like the invariant interval. 


Varieties of space-time intervals 


We spent a lot of time talking about the space-time interval (A t, Ar) between two ticks of a 
moving clock. This is called a “time-like” interval, because there is a frame in which it has a time 
component but no space component — the frame in which the clock is at rest. In fact, any space 
and time interval with components 

(At,Ar) (19) 


that satisfies 


At > 



( 20 ) 


has the property that there is a frame in which it looks like (At, 0) with At > 0. Why? One way 
to see this is to note that any interval that satisfies (20) could be the interval associated with a clock 
moving with velocity 



( 21 ) 
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simply because if the clock moves with v for a time At, it ends up translated by the vector A f, 
reproducing (19). Thus if we go to a frame moving with velocity v, we will be moving along with 
the clock and in this new frame the intervals will have the form 


(Ar, 0) 


where 


Ar = 


/ / \ 2 


v( At ) - 

Ar 


( 22 ) 


This relation incorporates the statement of time dilation. Ar is always less than or equal to At. 


Two events that are separated by a time-like interval are closest together 
in time in the frame in which they occur at the same point in space. 


(23) 


The frame in which the two events occur at the same point in space is the frame in which the two 
events could be two ticks of the same clock. Thus (23) is equivalent to the statement that a moving 
clock ticks slowly. Two events that are separated by a time-like interval are sometimes referred to 
as “time-like separated.” 

On the other hand, if 


At < 



(24) 


then it cannot represent the interval between two points on the path of a clock. As we will see in 
more detail later, we cannot go to a frame moving with velocity v = Ar/At, because \v \ > 1, so 
the frame would have to moving at a speed greater than the speed of light, which is impossible for 
clocks and meter sticks and all the other things we need to have a frame of reference. Then what 
is this interval? Suppose that we look in a frame of reference moving with velocity 


u = 


Ar 



(25) 


To see what happens, let us rotate our coordinate system until Ar is in the x direction 


A f — 


Ar 


x 


(26) 


so that 


u = ux 


where 



(27) 


Now in a frame moving with velocity u, the y and z components of the interval remain zero, and 
we can compute what the time and x components become using the Lorentz transformation, 


At' = 


Ar' 


" vT 

= 7t 

= v / r 


vr^( At 

, 1 _ (1a f 

v 1 


u 


u 


u 


— U- 


- tr 
Ar 

Ar 

Ar 


1 — u 


Ar 

u At 
At 


= 0 


Ar 

M_ 

= \/l Ar T - ( At ) 


2 


( 28 ) 
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This is called a “space-like” interval, because there is a frame of reference in which it has a 
space component but no time component. A very important example of a space-like interval is 
the interval between two ends of a measuring stick at fixed time. This is what we define to be a 
measurement of distance. Two objects are a distance one meter apart in a given reference frame at 
time t if we can line up a meter stick so that the objects are at opposite ends of the stick at time t. 
This distance is associated with a space-time interval with is the difference between two “events” 
which are the space-time coordinates of the two ends of the meter stick at time t. This is obviously 
a space-like interval, because the time component is zero. This corresponds to the primed frame in 
(28). The properties of space-like intervals are then responsible for the phenomenon of “Lorentz 
contraction”. In any other frame, the two events occur at different times. Therefore, (28) implies 
that A r > A r' . The two events are closest together in the frame in which they occur at the 
same time. This is generally true for any space-like interval. 

Two events that are separated by a space-like interval are closest together ^ 
in space in the frame in which they occur at the same time. 

Thus for example, if we measure the length of a moving train, the length we measure is the distance 
between two events describing the positions of the front and back of the train at the same time in 
our frame. But in the train frame, these two events do not occur at the same time, and thus the 
distance between them is greater than what we measure (by a factor of 1/\/l — v 2 ). This is Lorentz 
contraction. 

Two events that are separated by a space-like interval are sometimes referred to as “space-like 
separated.” 


Interlude on relativistic units 


Let’s talk about the addition of velocity formula you saw in Morin 

Vi + v 2 
1 - Vi v 2 /c 2 ' 

In sensible units in which c = 1, this looks l ik e 

Vi + v 2 
1 - ViV 2 


(30) 

(31) 


One can ask, suppose, in sensible units, you get a result like (31). How would you know how to 
put the factors of c back into (31) to get (30)? This is a crucial step because it allows you to do 
calculations with c = 1 , to make things simple, but still get the full answer at the end of the day, 
so you can talk to people who insist on using dumb units. 

If you have done a calculation in relativistic units and you want to put the factors of c back in 
to translate the result to conventional units, you have to know the conventional units of the objects 
you have calculated. Then you put in factors of c to get the units right. 

In the case of (31), we know that we are calculating a velocity, and the numerator is already a 
velocity. So we can leave that alone and make the denominator dimensionless. The 1 term is OK, 
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and we can make the uv term dimensionless by dividing by two factors of c — and that is where 
(30) comes from. 

Another example: Suppose that you are calculating an energy in terms of a mass m and a 
velocity v and you get the result 

6 -7T mv 4 (32) 

The 67 t is a dimensionless number — we can forget about that. But the rest would not have units 
of energy in conventional units (remember, for example, that Newtonian kinetic energy is \mv 2 ). 
We have two too many factors of v, so we must divide by c 2 — thus the result in normal units is 


6n mv 4 /c 2 


(33) 


Here’s another one. For a system of two particles, with masses mi and m 2 and speeds V\ and 
v 2 , suppose you calculate an object that is a mass, and the result is 

mi/vi + m 2 v 2 
mi/m 2 + viv 2 

Let us start with the denominator. We don’t know what the units should be of the denominator 
alone (because we can always multiply numerator and denominator by a power of c), but it is clear 
that the two terms in the denominator must have the same dimension. So for example, we can 
divide the second term by c 2 . That makes the whole denominator dimensionless, so the numerator 
must then have the dimensions of mass. We can arrange this by multiplying the first term by c and 
dividing the second by c — thus the result in normal units is 


mic/vi + m 2 v 2 /c 
m 1 /m 2 + vi v 2 /c 2 


(35) 


The tip of tomorrow 

So then why is it that you are not used to thinking about space-time as a 4-dimensional object? I 
think that once again it is because you are slow, of course. But let’s look at this in a little more 
detail in the light of space-time events and the varieties of space-time intervals. You are used 
to thinking of time and space very differently. Space is spread out around you to infinity in all 
directions. Time is not spread out at all. Time passes, and you live in the present on your way from 
the past to the future. Here is a cartoon of our subjective picture of space and time, plotted just in 
the x — t plane. 
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But much of the difference between space and time is an illusion. Time and space are different, 
to be sure, because of that crucial minus sign in the invariant interval. The passing of time is real. 
But the spreading out of space is a fiction. It is a drastic oversimplification that has been built into 
our brains and our sensory apparatus over thousands of generations because we are slow. What 
we don’t think about or notice is that when we look out at the space spread out around us, we are 
actually looking back into the past. And we cannot access the future around us instantaneously. 
The real story looks more like this. 

future future future 

t T T T r T 

space? ~ — = 

T T T T T T 

past past past 


future future future future 

T T T T T T 

^ space? 


i 

\ 


T 


T T T t 

past past 


T 

past 


T 


T 

past 


T 


The slight slope of the long almost horizontal arrows is the non-zero value of 1 /c - the effect of 
the finite speed of light. If we were faster, we would recognize intuitively that the “space” that is 
spread out on all sides of us between these arrows at this moment in time is just as inaccessible to 
us as the future. This is the region of space-time events from which we are separated by space-like 
intervals. We can’t get to these events, nor can they get to us. This region is neither the past nor the 
future. There are three different regions of space-time - the past - the future - and the space-like 
separated that is neither past nor future. If we were fast, then the picture would look more like this. 


future 


future 


T 



T 


past 


past 


The boundary between the space-like separated region and the time-like separated regions of our 
past and future is a cone called our light cone. Our light cone consists of all the events in space- 
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time that we could communicate with or receive communication from by means of a single light 
signal, moving at the speed of light. In the figure, it looks like two crossed lines, but we can rotate 
in space. If we rotate it just in the x — y plane and keep z — 0 (for example), we get an ordinary 
three dimensional cone in the three dimensions, x, y and t. The light-cone of a given event, and 
with it the division of space-time into three regions - past, future, and space-like separated - looks 
the same in every reference frame because the speed of light does not change. The light cone is a 
funnel from our past to our future. In natural units with c = 1, it looks like this: 



Of course, if we make an arbitrary space rotation, we get a four dimensional cone, which is harder 
to visualize, so I recommend just ignoring z for now. 

The light-cone separates space-time into three regions, the time-like past which could have 
affected us, the time-like future that we can affect, and space-like unknown on all sides that can 
neither have affected us nor be affected by us. This picture doesn’t matter much to us in our 
everyday life, because we are so slow. But it will be crucial when we try to understand the large- 
scale structure of the universe later in the course. 
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| future future * 



You can see in these pictures that our present is the singular tip where the past light cone meets 
the future light cone. This is what I call “the tip of tomorrow” because it is the single point in 
space-time that separates our past from our future. 

As our present evolve into our future, our light-cone moves with us, further narrowing the 
future, and opening us to effects from events that were space-like separated. 
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For now, I hope that this picture helps you to understand why relativity seem so strange. And 
maybe this will help you to avoid the wrong thinking than can so easily get you derailed thinking 
about relativistic trains. Seen properly, through the lens of relativity, the “the present” is not in any 
sense an infinite three dimensional space. The present is both right NOW and RIGHT HERE! 
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lecture 12 


Topics: 

World lines and proper time 

Massive particles 

Energy and momentum 

Example of inelastic process 

Lorentz transformation of energy-momentum 

Energy, momentum, velocity and mass 

Massless particles 

4-vectors and the invariant product 

World lines and proper time 

There is a nice relativistic analog of the concept of a trajectory in Newtonian physics - based on 
the concept of a world line. The idea is that as any massive particle evolves with time, whether 
it is sitting still or moving, free or accelerating, it can always be described by some curve in 4- 
dimensional space-time that is just the collection of the space-time events (t, rit j 'j that describe 
where the particle is at every time. The reason that it is useful to think about this in this funny 
4-dimensional space (even though it is kind of hard to visuallize) is that the world line itself has 
an invariant meaning because it is a collection of space time events that have invariant meaning 
even though their coordinates will change depending on the inertial frame. The situation is similar 
to that of a curve in three dimensional space. The curve consists of points that have an invariant 
meaning, but their coordinates change depending on the coordinate system. Also, just as we can 
label the points on a curve in three dimensional space by the distance along the curve from some 
arbitrary point, so we can measure the distance along world line of a particle by measuring the 
time ticked on the particles internal clock. This is called the proper time, r. The change in proper 
time dr along any short segment of the world line is just 

dr = dt/'f = dty/l — v 2 = V dt 2 — dr 2 (1) 

where t and v are measured in whatever coordinate system and inertial frame you like. You can see 
that this is independent of the inertial frame and the coordinate system by looking at the last term, 
which is simply the invariant interval for the short line segment. It is crucial here that every short 
line segment along the world line of a massive particle is time-l ik e. This is true because massive 
particles can never travel as fast as light. We will return to this again and again and understand it 
various ways. 

A good way of describing the world line is to give both t and r as functions of r: 

t(r) r(r) (2) 

For example, for a particle at rest 

f(r) = r r(r) = 0 (3) 
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and for a particle with velocity v 


t{r) = yr r(r) = jvt (4) 

For travel in one dimension, we can plot world lines on a plane. For example, consider the twin 
paradox. In their own frame of reference, the world lines of earth (and twin 2 on earth) and planet 
X are vertical lines. The world line of twin 1 in the rocket has a kink where the rocket turns around 
- in the curious geometry of spacetime - the kinked line takes the shorter time. 



Massive particles 

I now want to change gears from talking about space and time to talking about energy and momen¬ 
tum. I am going to write down the Lagrangian for a free relativistic massive particle. Because this 
is a free particle, the Euler-Lagrange equation is not all that interesting. We know without thinking 
about it that it is going to tell us that the particle moves at constant velocity. Nevertheless, finding 
the right form for the Lagrangian can be instructive. It will allow to ask questions about what 
would happen if we put in forces, for example. Also, it allows us to identify the quantities that we 
expect to be conserved because of Noether’s theorem and translation invariance in time and space. 
In fact, once we write down these quantities — better known as the energy and the momentum — 
we will largely forget about the Lagrangian formulation. So don’t panic if this Lagrangian looks 
so strange that you can’t deal with it. What will really matter is that you learn to deal with the 
relativistic energy and momentum that come out of it from Noether’s theorem. 

What principles should we use to write down the Lagrangian for the free massive particle 
moving at relativistic speeds? Surely one important principle is that the Action should look the 
same in all reference frames. This is reasonable, because if it were not true, Hamilton’s principle 
would not necessarily give us equations of motion that give the same trajectories in all reference 
frames. In addition, if we call the position of the particle r, we expect that the Lagrangian is 
independent of f and of t, and depends only on r, because it should be invariant under translations 
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in space and time. In a moment, I will show that the following Lagrangian gives rise to an action 
that is independent of the reference frame: 


L(r, f ) 


-m 


■_2 

r 


(5) 


We are using units with c = 1 as usual. Here are a few things to note. This looks a bit funny 
because of the square-root. But as we will see, that is what relativity requires us to write down, so 
we will just have to live with it. The constant m, which we will see in a moment is the mass of the 
particle, must be there so that the Lagrangian has units of energy. 

Now let us show what happens to the action 


S = — m 


J 


dt V1 — f 


( 6 ) 


under a Lorentz transformation. This is a little complicated because both the integrand and the 
dt change under a Lorentz transformation. But we can make what is going on more obvious by 
writing (6) as follows: 

S = —m J dt^jl — ( df/dt) 2 = — m J \J dt 2 — (dr) 2 (7) 

The second form is a very funny looking integral, but it makes sense because it is equivalent to 
the previous form. The important point is that the last form is evidently unchanged by Lorentz 
transformations. The infinitesimal interval 


(dt, dr) 


( 8 ) 


is just a space-time interval. It transforms under Lorentz transformations just like (At, A r). And 
therefore, the combination 

dt 2 — (dr) 2 (9) 

is just an infinitesimal version of the invariant interval, and it has the same value in all inertial 
frames. To get something proportional to dt (so that we can put it under an integral sign and get a 
finite result), we must take the square-root of (9). That is why (6) looks the way it does, nutty as 
that is. 


Energy and momentum 


Now that we have a Lagrangian, we can construct the conserved energy and momentum that we 
expect for a relativistic particle. The energy is 


P u dL T 
h = r ■ —^ — L = m 

dr 


■2 

1 — r 


+ mV1 — r = 


m 


m 


1 — f 


Vi 


= my. 


( 10 ) 


The momentum is 


_ d L 
p = —- = m 
dr 


r 


1 - 


■_2 

r 


mv 

a/1 — V 2 


mv'y. 


(ID 
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These things look really funny, but these are the energy and momentum that we expect to be 
conserved because of Noether’s theorem. Of course, for a single free particle, this is pretty trivial. 
Nothing changes, so everything is conserved. But in fact, these objects (10) and (11) are conserved 
in exactly the way non-relativistic energy and momentum are conserved at low speeds. If we have 
a system with several particles and we add up all the energies or all the momenta, we get something 
that doesn’t change as the system evolves in time. The energy and momentum may flow around 
from one particle to another, but the total does not change. 

Here is an example. Suppose we have a particle with mass m traveling at v — |. Then 


1 _ 5 _ 5 _ 5 _ 5 
7 ~~ ^1 - (4/5)2 v/25^16 “ 79 “ 3 


( 12 ) 


so 



p = v'ym 



(13) 


You will observe that these do not look like the expressions for energy and momentum of a 
Newtonian particle. Nevertheless, these are the objects that are really conserved. It is instructive 
to look at them with the factors of c put back in: 


E 


me 2 

\Jl — v 2 /c 2 


(14) 


T11V 


P = 


(15) 


sjl - v 2 /c 2 

In this form, it is also useful to Taylor expand in powers of v/c in order to see what they look like 
at small velocity: 


E = 


me 


yjl - v 2 /c 2 


= me 2 + -mu 2 + 


(16) 


P = 


rnv 

yjl — v 2 /c 2 


mv + ■ ■ ■ 


(17) 


As we expected, the Newtonian expressions for the kinetic energy and the momentum appear. But 
in the expression for energy, (16), there is also a constant term, the famous me 2 . The me 2 in (16) 
is a constant that is irrelevant to the Euler-Lagrange equations, so at small velocities, the velocity 
dependent part of (16) reduces to the non-relativistic Lagrangian, as we expect. What is actually 
important about the extra term is that mass plays a very different role in relativistic processes 
than it does at lower speeds. In Newtonian mechanics, mass, kinetic energy, and momentum are 
separately conserved. But in relativistic physics, it is only the momentum p and the total energy 
E that are actually conserved in arbitrary collisions of elementary particles at any speed — even 
in processes in which particles are created or annihilated. Mass is not conserved. The mass of 
any one kind of particle is always the same, so mass is conserved in collisions that don’t change 
the type of particle involved. For example if you accelerate an electron to speed v and it collides 
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with an electron at rest, some of the time you will get a collision in which you end up with two 
electrons going off in different directions and no other particles. We might represent this process 
by the schematic “equation” 

e~ + e~ —> e~ + e~ (18) 

In this case the sum of the masses of the particles before the collision is the same as the sum of the 
masses of the particles after the collision. This is the analog in the relativistic world of an “elastic” 
collision in nonrelativistic physics, and I will sometimes use the same word to describe it. But 
other things can happen that don’t conserve mass. For example, in the process of the collision, you 
may produce an extra electron (so there are three electrons in the final state) and a positron (an 
anti-electron - with the same mass but opposite charge). 

e~ + e~ —» e~ + e~ + e~ + e + (19) 

Here the sum of the masses before the collision is 2 m e , and the sum after the collision is 4m e . 
Mass is not conserved. In fact, there is no reason to compute the sum of the masses at all. It is 
just not an interesting quantity. In fact, the words “elastic” and “inelastic” mean something a little 
different when applied to relativistic collisions than the do they do in the nonrelativistic case. For 
a nonrelativistic elastic collision, the total kinetic energy, J2 \mv 2 , is conserved. For a relativistic 
elastic collision, the total relativistic energy, mj, is conserved. In both cases, the particles in the 
initial state are the same as the particles in the initial state. For a nonrelativistic inelastic collision, 
kinetic energy is not conserved. Some of the kinetic energy is converted to heat and effectively 
lost. At the microscopic level, this is really a very complicated process in which kinetic energy 
of the inital objects is converted to kinetic energy associated with random motion of their parts. 
In relativistic physics, we use the term “inelastic” to mean something quite different — that new 
particles are created or destroyed. In this case, while energy is conserved, mass is not. This is 
summarized in the following table. 

small v large v 


elastic 


inelastic 


KE = J2 \rnv 2 is conserved 
— mass is conserved — 

E = J2 m l is conserved 
particles are conserved 
therefore 

— mass is conserved — 

KE = J2 \mv 2 not conserved 
— mass is conserved — 

E = J2 my is conserved but 
new particles are 
produced or destroyed 
— mass is not conserved — 


So once again, this shows that conservation of mass is just not something fundamental. Sometimes 
mass is conserved. Sometimes it isn’t. 
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Example of inelastic process 

Here is an interesting and important particle decay process 

J/ip —■> e~ + e + (20) 


where e~ is an electron, e + is a positron, the antiparticle of the electron, and J/ip is a particle that 
has two names for historical reasons. It was discovered allegedly independently at Brookhaven 
and at SLAC. Anyway, it has a mass of rrij « 3097 MeV. You can make it in particle collisions 
and it very quickly decays in one of various ways. Let’s summarize what energy and momentum 
conservation implies for J/ip at rest decaying into an electron and a positron. 


particle 

J/iP 

e + 

e 

V 

0 

v+ 

v 1 

E 

Tfl'J/'ip 

m e 7+= mj/^/2 

m e 7 _= mj/^/2 

P 

0 

m e l+v+ 

m e 7_n_ 


( 21 ) 


energy 

conservation 


mj/^ = m e 7 + + m e 7_ 


momentum 

conservation 


0 = m e 7+7+ + m e 7 _n_ 


7_77_ = — 7 + n + =>■ V- = — v + 


( 22 ) 

(23) 


Lorentz transformation of energy-momentum 

One of the important things about energy and momentum is that they behave under Lorentz trans¬ 
formations very much like a space and time interval. Remember that this has to do with what 
happens to their values when we go from one reference frame to another. The easiest way to un¬ 
derstand what happens to energy-momentum is to imagine that the particle has a clock on it and 
consider the space-time interval between two ticks of the particle’s clock, as we did in the last lec¬ 
ture. Consider, then, the space-time interval between two ticks of a particle’s clock. A space-time 
interval between two events has a time component, At, that is the time that elapses between the 
two events, and a space component, A r, that is the vector from the position of one event to the po¬ 
sition of the other. In relativistic units, of course, these two components have the same dimension. 
If the particle is sitting still, this interval has a time component, call it At, but its space component 
vanishes. 

Now suppose that the particle is moving with velocity v. Then, because of time dilation, the 
time interval between the same two ticks of the particle’s clock is 

At = At 1 (24) 

V1 — v l 
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But then, because 

< \< 

(25) 

we find 




A r = At . 

y/l — v 2 

(26) 

But then we can write 

777 

(E,p) = —(At,Ar) 

(27) 


But both m and r are invariants — constants — they are properties of the particles involved, not 
of the frame. Thus we conclude that ( E,p) must transform just like (At, A f) because the two are 
just proportional to one another. 


4-vectors and the invariant product 

I hope that by this time you are getting used to the idea of changes from one inertial frame to 
another, and the accompanying Lorentz transformation, as a kind of 4-dimensional analog of what 
we do in three dimensional space when we go from one coordinate system to another. After devel¬ 
oping the analogy between 3-dimensional space and 4-dimensional spacetime, how can we not go 
on to develop the analogy between 3-dimensional vectors and 4-dimensional vectors. Indeed, our 
treatment of the invariant interval was the first step in developing this analogy. We saw there that 
the invariant interval (this time with c = 1 ), 

s 2 = (ti - f 2 ) 2 - (n - r 2 ) • (fj - f 2 ) (28) 

is a kind of length. Like the length of a 3-dimensional vector, it has the same value in all coordinate 
systems, but here of course, the idea of coordinate system is enlarged to include different inertial 
frames, moving with different velocities. 

The obvious way to go further is the find analogs in spacetime for the 3-vector, Ar, and for the 
dot product. The analog of the 3-vector is pretty obvious. It is a 4-vector, with 4 components, the 
first of which (which we will call the t component, just to remind us that it is special) is the time. 
So a 4-vector is a quartet of numbers, 


A — ( A t , A x , A y , A z ) (29) 

where A t is refered to as the time component, and A x , A y and A z are the space components, which 
are the three components of a 3-vector. We will sometimes write the 4-vector as 

A = (A t , A) (30) 

recognizing that the last three components form an ordinary 3-vector. 

Example: The difference between the components of two spacetime events forms a 4-vector 

Ar = (At, Ax, Ay, Az) = (At, Ar) (31) 
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I may sometimes refer to the four components of a 4-vector using different subscripts, 

(Aq, Ai, A 2 , As) <-> (At, A x , A y , A z ) (32) 

I will try not to use this notation, but I may sometimes slip. So just remind me if I do, and remember 
that these are simply two different notations for describing the same object. 

Not any quartet of coordinates is a 4-vector. What makes a 4-vector a 4-vector is that it behaves 
like the coordinate interval, (31), under a change from one inertial frame to another, that is under 
a Lorentz transformation. Thus if A is a 4-vector, then under a Lorentz transformation to a frame 
moving with speed v in the +x direction, the components of A go to a new set of components A' 
related to the first by the Lorentz transformation 
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describes a rotation by an angle 9 about z axis for all possible vectors! This is at the heart of why 
the idea of vectors is so important. 

We have now seen two things that behave like 4-vectors - the space-time interval and the 
energy-momentum. You will read about others in Chapter 12 of Dave Morin’s book. 

Now for the analog of the dot product. If we have two 4-vectors, A, from (29), and B , 

B = (B t , B x , B y , B z ) (38) 

then the combination 

A- B = A t B t - A-B (39) 

has the same value in any frame of reference (just as the ordinary dot product has the same value 
in any coordinate system). The notation here is a bit condensed. If you see a dot product between 
things that don’t have vector indices (or bold face in David Morin’s book), you should assume that 
the things are 4-vectors and that the dot product is the 4-dimensional invariant product defined by 
(39). You can check that a Lorentz transformation to a reference frame moving the in x direction 
does not change the value of (39). It is also obviously unchanged by rotations because the space 
vectors enter only through the ordinary dot product, which is unchanged by rotations. Note also 
that if we set B = A, we recover the equation for the invariant interval, s 2 , and therefore we can 
write the invariant interval, (28), as 

s 2 = Ar ■ Ar (40) 

just as for 3-dimensional vectors, the square of the distance between two vectors is a dot product, 

g 2 = A r ■ A r (41) 

Right now, 4-vectors and the invariant product probably look like just a pretty mathematical 
analogy. But we will see shortly when we talk about using the constraints of energy and momentum 
conservation that they are an indispensible part of our tool box for dealing with the relativistic 
world. 

Energy, momentum, velocity and mass 

There are several ways of writing the relation between energy, momentum, velocity and mass. The 
one that we started with, 



is actually not the most useful. It doesn’t make sense for massless particles, such as the particles 
of light itself, because both the numerator and the denominators vanish as m —> 0. However, 
we can combine these into two relations that are even more general, and make sense for any m. 
First consider the obvious one — form the invariant product of the energy-momentum with itself. 
Explicit calculation gives 
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As expected, the dependence on v has gone away, because the invariant on the left hand side 
does not depend on the inertial frame, and thus cannot depend on how fast the particle is moving 
(because the speed changes when we go from one frame to another). 

We can also get a relation that makes sense as m —» 0 by dividing the expression for momentum 
by the expression for energy, 



(44) 


These two relations are the most general formulation of the relations among energy, momentum, 
velocity and mass. I put boxes around them because they are very very very important. When 
combined with the invariant scalar product, these relations are incredibly powerful. 


Massless particles 

When m = 0, (43) and (44) are perfectly sensible, but the result is a bit odd. For m = 0, (43) 
implies that 

|p I = E (45) 

Then (44) implies that 

v = p/\p\ = p (46) 

which means that a massless particle always travels at the speed of light. If you think about this for 
a minute, it really makes your head hurt. For one thing, it means that there is no way that the state 
of such a particle can be specified only by its position, because massless particles with different 
energies move at the same speed, and thus cannot be distinguished by their speed. Thus unlike 
the classical particles you are used to, massless particle with different energy can have exactly the 
same trajectories. The resolution of this puzzle classically is to say that one shouldn’t talk about 
massless particles at all, but just about classical waves like the electromagnetic waves you will 
leam more about in Physics 15b. But quantum mechanically, these particles really exist. In fact, 
what Einstein got the Nobel prize for was not relativity, but for his explanation of how light could 
knock electrons out of a metal (the photoelectric effect) because a light wave of frequency v can 
be regarded of consisting of massless particles each with energy hv where h is Planck’s constant. 

There is a related issue that sometimes causes confusion. Some of you have probably seen 
relativity before, and you may have been exposed to the rather idiotic notion of a “rest mass” that 
is the actual mass of the particle and a “relativistic mass” that depends on velocity. This is not 
useful! If you ask a physicist what the mass of the electron is, the response will be 

m e « 9.11 x 10“ 28 g (47) 

or 

m e sa 0.511 MeV (48) 

The response will certainly not be “Do you mean the rest mass of the electron?” or “How fast is 
your electron moving?” 

And of course, this doesn’t make any sense at all for massless particles like the photon. 
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Not only do these silly notions of “rest mass” and “relativistic mass” not correspond to the way 
physicists actually talk about these things, but the motivation for them (such as it is) is philosophi¬ 
cally flawed. I think that the idea was to preserve the form of the equation 

F = ma (49) 


for large velocities. There are two problems with this. One is that it doesn’t work, even if you allow 
m to depend on v. You can still only preserve this form in certain special cases. But more impor¬ 
tantly, you shouldn’t want to preserve this form anyway. We have seen that the more fundamental 
form that arises naturally in a Lagrangian description of mechanics is 

F = ijf (50) 


We will see next week that this relation survives intact in relativistic physics. No silly definitions 
are required. 

So if you are used to using the term “rest mass” and “relativistic mass,” you should try to get 
over this as soon as possible. They will only cause you grief and confusion in this course and 
beyond. If all else fails, perhaps hypnosis might help. 

As it happens, I recently received an email from the great Russion physicist Lev Okun, asking 
me to comment on his paper “The mass versus relativistic and rest masses” in which he discusses 
the history of this issue. His argument for many years is that the notion of the “relativistic mass” 
m(v) obscures the underlying symmetry — the 4-vector nature of ( E,p ). In the fundamental 
equation 



(51) 


m is constructed using the invariant product of the 4-vector ( E . p) with itself. Thus the right hand 
side of the equation is an invariant quantity that has the same value in all reference frames. In 
4-d as in 3-d, we call such a quantity a scalar. Mass is a scalar and is not conserved. Energy is a 
component of a 4-vector and is conserved. Don’t confuse them! 

I have put Lev’s paper up with the lecture this week. If any of you have comments on the paper 
or suggested improvements, please let me know and I will pass them along to Lev. 
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lecture 13 

Topics: 

Where are we now? 

Particle collisions 
K + decay 
Neutrino scattering 
Minimizing and maximizing 
p decay 
Colliders 


Where are we now? 


Last time, we introduced the notion of 4-vectors and in particular, the 4-vector of energy and 
momentum that is conserved if the laws of physics are invariant under Lorentz transformations and 
translations in time and space. For free massive particles, these have the form 


E = 


m 

\/l — v 2 


_ mv 

p= VT^ 


(i) 


But I suggested that more useful and general way to think about energy and momentum of a free 
particle is think of the four quantities 


energy 

E 

momentum 

P 

mass 

m 

velocity 

V 


( 2 ) 


as related by the two relations 



(3) 


and 

v = p/E 


(4) 


From (3 ) and (4), you can derive (1) if the mass is not zero. But (3) and (4 ) are still true even when 
the mass is zero and (1) doesn’t make sense. 

In addition, we discussed the invariant product of 4-vectors, which has the same value in all 
inertial frames. 

Today, I now want to work through a bunch of examples of how these are useful. Most of these 
examples are taken from my own field of elementary particle physics, because particle physicists 
live with relativity every day. 
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Particle collisions 


The relativistic energy and momentum that we derived last time are incredibly important. We 
derived these expressions by thinking about single free particles, but they are much more generally 
useful. The reason is that in almost all interesting situations, we can think of particles as free 
most of the time, except when they are actually colliding with one another. Then most of the 
time, the energy and momentum is just given by the sum of the energies and momentum of the 
particles. In a collision, the individual energies and momenta will change, but the total energy and 
momentum will be the same before and after the collision, even when new particles are created or 
when particles initially present are annihilated. 

This is conservation of energy and momentum. Conservation means simply that when we add 
up the energies and momenta of the particles in the initial state of some scattering process the result 
is the same as if we add up the energies and momenta of the particles in the final state. The thing 
that I want to try to convince you of today is that it is much easier to determine the constraints 
that come from energy and momentum conservation if we think of the energy and momentum as a 
4-vector, and use the fact that the invariant product of 4-vectors is independent of the frame. 

Today, I want to do a lot of examples of the use of conservation of the energy-momentum 4- 
vector to analyze the decay, scattering, and production of particles. There is a very simple general 
idea that underlies all of these problems. The idea is to get rid of things that you don’t know by 
using the relation E 2 — p 2 = m 2 . Let’s jump right to examples. 

K + decay 

There is a particle called the K + (pronounced “kay plus”). It is called a “strange” particle for 
historical reasons. This is not because it is peculiar (at least it doesn’t seem peculiar any more, 
now that we know what it is), but because it carries a property called “strangeness”. Anyway, 
it decays rather quickly into a pair of pions. Pions are the lightest of the particles made out of 
quarks and antiquarks (generically called hadrons) so they show up often. The I\ ' can decay into 
one neutral pion (called 7 r° — “pi zero”), which has a mass of about m n 0 « 135 MeV and one 
charged pion (called 7r + — “pi plus”), which has a mass of about m n + ~ 140 MeV. The K + has 
a mass of rriK+ ~ 494 MeV. Now suppose that the decay of the K + occurs at rest. What are 
the energies of the two pions? This is a typical sort of question in what might be called decay 
kinematics. To answer such questions, we think about 4-vectors and use conservation of energy 
and momentum. Let us call the energy-momentum 4-vectors K for the K and 7r + and 7 r 0 for 
the 7r + and 7r° respectively. Conservation of energy and momentum is the statement that the 4- 
dimensional vectors satisfy 

K = 7T+ + 7 r 0 (5) 

This is a short-hand for four equations, conservation of energy and conservation of each of the 
three components of momentum. Note that it is true in any frame of reference. In the rest frame, 
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the 4-vectors look like 

K = (m K , 0) 

n+ = {E + ,p + ) (6) 

7T 0 = (E 0 ,p 0 ) 

We have used a standard trick here — one that you should be familiar with from our rules of 
coherence. If you don’t know something, give it a name! Now (5) can be used to say things about 
(6), for example, p + = —p 0 . But let us try to resist this temptation. In problems l ik e this, it is 
often convenient to manipulate the 4-vectors symbolically for a while before actually doing the 
calculation. The idea of such manipulations is to be able to use the invariant product to compute 
what you want to know without having to calculate what we don’t care about. Here for example, 
suppose that we first want to calculate the energy of the 7T + , which we have called E + . If we could 
calculate the value of the invariant product K ■ 7r + = thk E + , that would immediately give us E + . 
So suppose that we rewrite (5) as 

K - 7T+ = 7T 0 (7) 

Now if we take the invariant product of each side of this equality with itself, we will get terms 
involving K ■ 7r + = m k E + : 


(.K — 7T_|_) • {K — 7T_|_) = K • K — 2 K • 7T_|_ + 7T + • 7T+ 
= m 2 K — 2 K ■ 7T+ + m 2 + = 7r 0 ■ 7r 0 = m^ 0 


( 8 ) 


Now we can solve this for K ■ 7r + 


K ■ 7T+ 


m 2 K + m 2 + - m 2 0 
2 


( 9 ) 


or 



+ mi + — rriio 

_ _ 7T~ _ 7T U 

2 m,K 


( 10 ) 


Easy, no? Now we can get E 0 either by repeating the same calculation with + and 0 interchanged, 
or by using energy conservation. The result is 


E 0 = 


m 2 K + ml o - ml+ 
2 rriK 


(ID 


If you were asked to do so, you could now go on and calculate the magnitudes of the momenta 
of the pions by using E 2 — p 2 = m 2 . From there, you could calculate the speeds, although this is 
seldom very interesting in such collisions. You cannot calculate the direction of the momentum or 
velocity, because this is actually quite random. The K + is a particle with no angular momentum. 
When it is at rest, there is no vector associated with it. And therefore there is no direction picked 
out for its decay products. They go off at random with equal probability in all directions. That is 
quantum mechanics. God plays dice with the universe. 
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Neutrino scattering 

Here is an example of an interesting scattering process. Neutrinos are very light particles. Until 
recently, we thought that they might be massless, like photons. But it now appears that the neutrinos 
have tiny masses. Furthermore, these masses are very peculiar. There are three different kinds 
of neutrinos: v e (an “electron neutrino”); z/ M (a “mu neutrino”); and v T (a “tau neutrino”). The 
names refer to the processes in which these neutrinos are produced, which involve respectively the 
electron and the heavier versions of the electron, the /i (“mu”) and the r (“tau”). The tiny masses 
do not respect these distinctions, and they produce bizarre quantum mechanical mixing between 
these different types of neutrinos. But if it were not for these weird quantum mechanical effects, 
we could ignore the neutrino masses altogether. So that is what we will do in this course. We 
will simply pretend that neutrinos are massless, which is an excellent approximation for the sort of 
questions that we can ask and answer in this course. 

In spite of the fact that neutrinos have no electric charge and almost no mass, it is possible to 
make beams of neutrinos. In fact, one of my colleagues, Gary Feldman, is part of a large project 
that involves making a neutrino beam at Fermilab outside Chicago and aiming it at a large detector 
in an underground mine in northern Minnesota. This should tell us more about the peculiar neutrino 
masses I mentioned earlier. 



But here, I just want to note that with these beams, we can observe bizarre processes such as 
the scattering of a u IJ; with energy E from an electron at rest to produce a final state consisting of a 
u e and a /i. The /i has a mass m /t about 207 times the mass of the electron, m e . 

Now, a question that you might ask about this process is the following. Suppose that you see a 
v e in the final state flying off at an angle & from the initial direction of the u IL : as shown: 

Initial state final state (^) 

V e 

•-► • «- 

U I' e \ e 

What does energy and momentum conservation tell you about the energy of the v e in the final 
state? 
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The 4-vectors look like 

e = (m e , 0,0,0) 

= ( E ,E, 0 , 0 ) 

u e = (Ei, Ei cos 0, Ei sin </>, 0) 


( 13 ) 


fi = (E 2 , P 2 cos 6, -p 2 sin 6 , 0) 


Some comments about this are in order. I have chosen to put the initial momentum along the 
x axis. That is no problem, because I can rotate my coordinate system to make it so. Likewise, 
I have assumed that the scattering takes place in the x-y plane, which I can again do by just 
rotating the coordinate system. I have put in the information that the neutrinos are massless by 
taking the lengths of their momentum vectors to equal their energies. If I have not done this and 
just given the momenta names, we would have quickly gotten to this point when we imposed 
E 2 — p 2 = m 2 = 0 on these 4-momenta. I have also, in my head, imposed a little bit of energy 
momentum conservation by writing // in the x-y plane (although we won’t use this at all). Again, 
if we had put in a 2 component for / 1 , we would have quickly realized that it must be zero because 
all the other 4-vectors have zero z component by construction. 

Now, we could simply impose energy and momentum conservation of the rest of (13), and 
try to solve the equations. But it is better to think. For example, we can easily find E \, because 
energy-momentum conservation 

+ e = n + v e (14) 


implies 


Vp. + e — v e = n 


(15) 


This is a good way to write things, because when we take the invariant product of each side with 
itself, all the nonsense in /i (which we don’t know yet), drops out, and we can write 


(z/ M + e - v e ) ■ (i^ + e - v e ) = p ■ n = 

The left hand side of (16 ) is 


(16) 


^ + e ■ e + v e ■ v e + 2 ■ e - 2 ■ v e - 2 e • v e 


(17) 


which with (16) implies 


ml + 2 m e E — 2 E -Ej(l — cos0) — 2m e Ei = m ^ 


(18) 


so that 


?7ig + 2 m e E — m 2 
2 E (1 — cos 4>) — 2 m e 


(19) 


Practically speaking, this is a bit of a swindle. There is nothing wrong with the calculation 
above, but a particle physicist would never ask you to calculate things in terms of the angle of the 
final state neutrino. This is because neutrinos are very hard to see. They very seldom interact with 
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anything. That is how they will manage to get from Chicago to northern Minnesota. But there 
are many ways of making the direction of charged particle track show up, all making use of the 
electric charge and its interactions. So it would make more sense physically to ask you to find 
things in terms of the // angle, 9, or its energy, E 2 , or momentum, p 2 . This is a little more involved 
algebraically, but the principle is the same. 


Minimizing and maximizing 


There is an interesting and useful class of questions in which the kinematics does not completely 
fix the interesting quantities, and you have to think about how to make them bigger or smaller. 
Here is a typical situation. Suppose that a particle with mass m\ and energy E collides with a 
particle with mass m 2 and the collision produces a final state with n particles with masses pi, p 2 , 
... p n . What is the minimum energy E required for a process that produces these particles in the 
final state to take place? And what is the total energy of the process in the zero momentum frame? 

First let me write down the answer, which is rather simple. The total energy of the process in 
the zero momentum frame is at least 

n 

E P=0 = X! P? = M min (20) 

3 = 1 


and the energy E of the particle with mass mi in the frame in which particle 2 is as rest is 


M min - m l 


rm 


2/719 


( 21 ) 


Let us begin the proof by discussing the question in general. The key to problems like this is 
to treat the whole final state as a single entity. The final state, whatever it is, will have some total 
energy-momentum 4-vector 

n 

T = (E T ,P T )=Y.( E hPi) ( 22 ) 

j-i 

Then we can define a “total mass” Mt for the final state using the fundamental relation between 
energy, momentum and mass — 

El - P T 2 = Ml (23) 

Then in terms of Mt, this problem is formally equivalent to the problem of producing a particle 
with mass Mt by colliding a particle with mass m\ and energy E with a mass m 2 particle at rest. 
Define the 4-momenta as 

Pi = (E,p) P 2 — (m 2 ,0) (24) 

Then 4-momentum conservation implies 


P 1 +P 2 = T 


(25) 


Taking the invariant product of each side gives 

Pi ■ Pi + 2 Pi ■ P 2 + P 2 • P 2 — m 2 T* 2?7i2 E T m 2 = M't (26) 
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( 27 ) 


Mf. — — rri'2 

E = - 

2rri2 

which should remind you of (21). Evidently, to minimize E we need to minimize Mr, and if we 
find that the minimum value of M T is the M min in (20), then we are done. So how do we do that? 

The advantage of considering M T is that it is an invariant quantity. It has the same value in 
all frames of reference. In particular, it looks the same in the zero momentum frame. Thus in this 
frame, the total energy is just Mr 

n 

T 1 = (E' T ,P T ) = £ (£' 4 )' = (E' t , 0) = (M T , 0) (28) 

3 = 1 

But (28) implies that 

n 

M T = Y E' (29) 

3 =1 

Then since the energy of a particle is always greater than or equal to its mass, (29) implies 

n 

Mr > Y ^3 = -Mr ni n (30) 

3=1 


If the particles in the final state are all massive — that is if they have /j,j > 0 for all j, then the 
bound (30) is saturated when all the particles are at rest in the zero momentum frame and this is 
the best we can do in minimizing M T . 

If there are massless particles, we cannot take their momenta to be zero, but we can take them 
to be very very small. In this way, we can come arbitrarily close to the theoretical minimum. 
If we now boost this final state by a Lorentz transformation, all of the massive particles will be 
traveling with the same velocity, and the massless particles will still have arbitrarily small energy 
and momentum. This is the best we can do. 

Here is a simple (and classic) example. Suppose that you want to make antiprotons. You can 
do this by hitting a proton at rest with a proton of energy E to produce a final state consisting of 
three protons, each with mass m p , and one antiproton (also with mass rn p ). What is the minimum 
energy required to make antiprotons this way? I should say that this is actually the way that 
antiprotons are made at Fermilab and CERN (where for example Professor Gabrielse uses them to 
make antihydrogen and study its properties). Because all of the final state particles are massive, 
we can take all four particles to be moving with the same velocity, v, and the total mass will then 
be M T = 4m p . Putting this into (27) gives the result: 


E 


Ml - 2 m 2 p 
2 m p 


16 m 2 p — 2 m 2 p 
2 m p 


7m p 


(31) 


Note we have obtained a little bit more than the answer we were looking for. We actually know 
something about the final state of the four particles when (31) is satisfied — that all the particles 
are moving with the same velocity. 

To think about this another way, imagine slowly cranking up the energy of our proton beam 
until we start to produce antiprotons. When this first happens, at an energy given by (31), the four 
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particles have the same velocities, but as we go to higher energy, the velocities are the same and 
some of the energy of our original beam is wasted in producing the extra energy of that relative 
motion, rather than just going into making antiprotons. 

This result is sort of neat, but the point is the style of argument. The idea is to think of whole 
collections of particles as having a “mass” — computed from the total energy momentum 4-vector. 

/! decay 

We have already talked about the fact that the heavy version of the electron called the muon, /i, is 
unstable. It decays into an electron, a muon neutrino, and an electron antineutrino: 

-> e~ + z/ M + u e (32) 

This process is actually very closely related to the inverse of the scattering process we discussed 
earlier, 

v e + pT —>• v, t + e~ (33) 

There is a sense in which the v e in (32) is related to a v e traveling backward in time. That is to say 
that a v e in the final state is related to a v e in the initial state. If we made this change in (32), we 
would get just the inverse of the process (33). In some sense, this is why antiparticles must exist 
for every type of particle. This is actually related to a discrete symmetry called CPT which stands 
for 


Charge Conjugation-Parity-Time reversal 

which might be an exact symmetry of the world. 

At any rate, we can ask the usual sort of questions about this. Here is a modest list. Suppose 
that the fi decays while it is at rest. 

1. What is the minimum possible energy of the electron? 

2. What is the maximum possible energy of the electron? 

3. What is the minimum possible energy of one of the neutrinos? I really mean neutrino or 
antineutrino here, but it takes too long to say that each time, and they are both nearly massless 
anyway, so I won’t bother. 

4. What is the maximum possible energy of one of the neutrinos? 

5. Consider the total energy and momentum of the two neutrinos as a 4-vector. What is the 
maximum possible value of the invariant product of this 4-vector with itself, E 2 — p 2 l Or 
equivalently, what is the maximum “mass” of the two neutrino system? 

6. What is the minimum “mass” of the two neutrino system? 

The idea of all these can be found by thinking about the analysis we just did for the mass of a 
system of particles. Let’s take them one at a time. 


8 


1. What is the minimum possible energy of the electron? The smallest energy the electron 
could possibly have is m e , which it would have if it were at rest. Is this possible? Sure! We 
can conserve energy and momentum if all the rest of the energy goes into two back-to-back 
neutrinos, so the 4-momenta would look like 

fi = (rri/j, 0) e = (m e , 0) = (E, Ev) v e = (E, -Ev ) (34) 

Conservation of energy and momentum works if E — (m M — m e )/ 2. 

2. What is the maximum possible energy of the electron? To get the maximum possible 
energy for the electron, what we want is that the effective mass of the rest of the stuff in 
the decay, the two neutrinos, should be as small as possible. But if the two neutrinos have 
parallel momenta, the mass of the two-neutrino system is zero. That is the best we can do. 
The process then looks l ik e 

y = (m /t , 0) e = (E,pv) v, x = (xp, -xpv) v e = (yp, - ypv) (35) 

where x + y — 1. We can calculate E easily as we have done in other problems if we note 
that y — e has mass 0, so that 

yji ^ TT1 ^ 

ml - 2 m.E + ml = 0 =*► E = 11 -- (36) 

M 2 

3. What is the minimum possible energy of one of the neutrinos? Either of the neutrinos 
can have arbitrarily small energy and momentum. There is enough freedom to satisfy energy 
and momentum conservation with the other two carrying the load. 

4. What is the maximum possible energy of one of the neutrinos? This is actually related 
to the previous question. The maximum neutrino energy arises when the neutrino recoils 
against the minimum possible mass, which is the electron mass, with the other neutrino 
carrying negligible energy and momentum: 

y = (m M , 0) e = (E,pv) i/„ = (p, -pv) v e = («(),« 0) (37) 

Now the mass of the 4-vector y — is y e , so 

o o — ml 

ml — 2 m ^p = m 2 e p — —^- (38) 

5. What is the maximum “mass” of the two neutrino system? We have really done this one 
already. The two-neutrino system has its maximum mass when it and the electron are both 
at rest, and the mass of the two-neutrino system is m /t — m e . 

6. What is the minimum “mass” of the two neutrino system? We’ve done this one also. If 
the two neutrino momenta are parallel, the mass is zero 
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Colliders 


Last time, we looked at the process 

J/ip —> e“ + e + (39) 

where e~ is an electron, e + is a positron, the antiparticle of the electron, and J/ip is a particle that 
has two names for historical reasons. It was discovered allegedly independently at Brookhaven 
and at SLAC. Anyway, it has a mass of mj ~ 3097 MeV. 

e~ + e + —> J/ip (40) 

Now suppose that we let positrons with energy E collide with electrons at rest to produce J/ips. 
What energy is required? The 4-vectors are as follows: 

e~ = (m e , 0) e + = (E,p) J/ip = (E',p') (41) 


Now we can use energy-momentum conservation 

e~ + e + = J/'ip 

Taking the invariant product of each side with itself gives 


nil + 2 m e E + ml = m 2 


so that 


E = 


m 


JN> 


— 2ml 


2 m P 


J/i’ 


9385 GeV 


(42) 


(43) 


(44) 


This is a huge energy scale, beyond what is presently available at accelerators. The problem is that 
the electron is very light. A collision between a very high energy electron and an electron at rest 
is a bit like a nonrelativistic collision between a moving truck and a feather. Very little energy is 
actually transfered in such a collision. 

However, it is much easier to produce the J/ip in a collider, in which an electron and positron 
collide with equal and opposite velocities and momenta. In this case, the 4-vectors look like 


e ={E,p) e + = (E,-p] 

Here, energy-momentum conservation implies 


J/ip = (mj/^,0) 


(45) 


E = mj/^/2 « 1.55 GeV (46) 

Colliding beams, in this case, make a huge difference. The difference is that in the collision with 
a particle at rest, much of the energy of the incoming particle is wasted producing kinetic energy 
of the collision products. This effect exists in Newtonian physics also, but it is much worse at high 
energies where relativistic physics takes over. Note that the problem is particularly bad for the 
electron, because it is the lightest particle with electric charge. As (44) shows, the energy required 
to produce a heavy particle of mass M in a fixed target collision between particles of mass m, much 
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less than M is inversely proportional to m, so the lightness of the electron is a terrible problem. 
But if you want to have the capability to produce the heaviest possible things, colliding beams are 
essential no matter what you are colliding. 

Why is this interesting? What are these heavy particles that particle physicists make, and why 
would you want to make them? The first thing to say is that we don’t need heavy particles to 
make heavy things. The things in the universe that are much heavier than protons and neutrons 
and electrons - from molecules to galaxies - are made by putting many copies of these light things 
together. This can be done in a practically infinite number of way and produces lots of interesting 
physics. But the heavy particles of particle physics are very different. They are not just light things 
put together in interesting ways. And there are only a very few of them, with completely well 
defined masses and properties. To make one of these particles with mass M, we must not only 
have enough energy, M, in the zero momentum frame, but we must also concentrate that energy 
in a tiny region of space and time, of size 1/M (in particle physics units). They are genuinely new 
indivisable degrees of freedom that appear only when look at the world at large energies and small 
distances. They are not useful, unless you want to know how the world works. 
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lecture 14 


Topics: 

Where are we now? 

Causality and rigid bodies 

Forces 

Relativistic strings 

Color force 

A relativistic oscillator 

The moving oscillator 

Where are we now? 

So far we have been discussing mostly the kinematics of special relativity. This week we will 
talk a bit about dynamics. I will talk about forces and the work-energy theorem in relativity in 
general. Then I will introduce the rather bizarre idea of relativistic strings. I have several reasons 
for wanting to do this. The first is that relativistic strings will allow us to do a completely honest 
analysis of a relativity “paradox” similar to some you have read about in David Morin’s notes. I 
have avoided these until now. Most of them seem more paradoxical than they really are because 
we are used to thinking of large objects as rigid. This makes no sense in relativity, and we will start 
today by reviewing that argument. But with relativistic strings, we will be able to build objects 
that get large, but can still be analyzed in a way completely consistent with special relativity. The 
second reason for discussing relativistic strings is that they are an incredibly hot topic in theoretical 
physics and mathematics these days. Needless to say, we won’t get very far along this path. But 
I thought that you might enjoy seeing just a tiny bit of it. Believe it or not, the third reason for 
introducing relativistic strings is that if you let youself go and really imagine that this stuff exists, 
it will actually help you think about forces in relativity. In particular you can use it as a mnemonic 
to help you remember how to Lorentz transform force. 

Causality and rigid bodies 

Some of our intuition about the way things work depends on the rigidity of solids. This leads to 
some of the paradoxes in relativity, because the notion of a rigid body makes absolutely no sense 
in relativity. This is a fun issue because it is related to the philosophical issue of causality — cause 
and effect — and is closely tied to the fact that information cannot travel faster than light. 1 The 
point is this. Two events that are separated by a space-like interval cannot have any effect on one 
another if we accept the principal that effects must come after their causes. For any two space-like 
separated events, there exist frames of reference in which either event comes first. We have already 
seen that given two space-like separated events, there exists a frame in which the two events occur 
at the same time. From this frame, we can look at frames which are moving in the direction of the 

1 We will return to this important principle several time in various contexts. 
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space separation between the two events. Then in a frame moving towards one event, that event 
must come first. You can see this directly from the Lorentz transform — it is related to the minus 
sign in (for a separation in the z direction) 

At' = - 7 ==(At - v Az) (1) 

V1 — v 2 

Or we can see it qualitatively from the classic argument for relativity of simultanaeity in which we 
send light pulses from the center to the two events. 

Now the problem with the idea of a rigid body is this. If you push on one end of a rigid body, the 
whole body starts to accelerate. This is impossible, because the two events that mark the beginning 
of acceleration at the two ends of the rigid body are separated by a space-like interval. In some 
frames, the beginning of acceleration of the far end of the rigid body comes before the beginning 
of acceleration where you push. So your push cannot cause the acceleration at the far end. And 
these are physical events. Both the push that causes the acceleration and the acceleration at the 
other end involve non-zero forces that are not just artifacts of a particular inertial frame. Thus rigid 
bodies are impossible unless effects can happen before their causes — which is not a good idea. 

This is closely related to the fact that information cannot travel faster than light. We have just 
argued that an effect and its cause cannot be separated by a space-like interval. They must therefore 
be separated by light-like or time-like interval. If the interval is time-like, a massive particle can 
travel at a velocity less than the speed of light from the spacetime coordinates of the effect (the 
“effect event” if you like) and the spacetime coordinates of the cause (the “cause event”), because 
these two events can be two ticks of the same clock. If the interval is light-like, light can travel 
from the cause event to the effect event. But these are the only possibilities. If the information 
travels from the cause event to the effect event at a speed greater than the speed of light, that means 
that the distance L between the cause and the effect is greater than c times the time difference T 
between the two events, which means that c 2 T 2 _ L 2 < 

0 and the interval is spacelike. Thus the 
far end of the body cannot possibly know that the body has been pushed any sooner than it takes 
light to travel from the push to the other end. Otherwise there would be a frame in which the effect 
happens before the cause. 

What is really going on here is this. Even in non-relativistic mechanics, the idea of a rigid body 
is an idealization. No real body is perfectly rigid. In fact, when you push on one end of a real 
object, compression waves travel through the object to transmit the information to all the parts. 
These waves travel at a characteristic speed (like “sound waves”) that depends on how heavy and 
how stiff the body’s material is. The stiffness is like the spring constant of a spring, and the stiffer 
the material, the faster the waves. But in non-relativistic physics, there is no reason why you 
cannot imagine a material that is arbitrarily stiff, so that the speed of these waves goes to infinity 
and in the limit, the body becomes truly rigid. In relativity, this is impossible, because these waves 
carry information, and thus cannot travel faster than light. 2 So the most accurate statement is that 

2 You will learn in Physics 15c, if you have not already, that it is really a bit complicated. What cannot be faster than 
light is actually the group velocity, which is not necessarily the same as the velocity at which the wave crests move, 
which is called the phase velocity. But it is the group velocity that describes how fast information can be transmitted. 
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relativity makes it impossible to build arbitrarily stiff material. Even the forces that hold matter 
together are built on the principle that information cannot travel faster than light. 

The notion that information cannot travel faster than light will be very important when we talk 
about the early universe, which I plan to do at the end of the course. 


Forces 


As we discussed when we talked about the Lagrangian for a massive relativistic particle, we can 
add additional terms that describe forces, and then as usual, the force on the particle will be the 
rate of change of the momentum. It is important to note that because the energy and momentum 
in special relativity are derivable from a Lagrangian, we can use the same relation between force 
on a particle, the particle’s momentum and the power transmitted to it by the force in relativistic 
physics that we do in nonrelativistic physics, 


F 


v ■ F 


dp 

dt 

dE 

dt 


( 2 ) 

(3) 


The first, (2), can be regarded as a definition of what we mean by force. It is just a rewriting of 
the Euler-Lagrange equation. The second, (3), which says that the dot product of the force and 
the velocity is the power supplied to the system, then follows from the definition of E. It is also 
consistent with the fundamental relations between energy, momentum, velocity and mass, 


E 2 = p 2 + m 2 v = p/E 


(4) 


We can see this consistency explicitly if we differentiate both sides of (4) with respect to t, to get 


„ dE ^ dp _ -± 

E lF = p 'di =p - F 


(5) 


or 


dE 

dt 



v ■ F 


( 6 ) 


This last can be rewritten by multiplying by dt as the relation between work done by the force and 
the change in energy. 


dE = d r ■ F 


(7) 


Again, this is not surprising. It had to work because the energy and momentum of the relativistic 
particle are derivable from a Lagrangian. 

Let’s discuss an example of all this. Suppose that a particle of mass m is subject to a constant 
force F 0 in the x direction. Lrom (2) we can immediately conclude that 


p = x F 0 t 


( 8 ) 


which is what matters in this argument. 
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Now suppose that we want to figure out the distance, x(t), the particle has traveled. We can do this 
by using (4) and (7). 


F 0 x(t)=J‘dt ld ^p- = E(t)-E( 0) 
= \Zp 2 + m 2 — rn — \J F 0 2 1 2 + m 2 — m 


( 9 ) 


Thus 


x{t) = 


\jFot 2 + m 2 


m 


( 10 ) 


This kind of trick with the work-energy relation is often useful. Study it. On your own, as practice 
with the Taylor expansion, you should verify that (10) reduces to the expected non-relativistic 
result in the appropriate limit. 


Relativistic strings 

When you pull on an elastic material object, it stretches, and stores the energy you feed into it 
in potential energy associated with the quantum mechanical electromagnetic interactions that hold 
matter together. In relativistic physics, one can imagine a different scenario. Imagine stuff that 
when you pull on it, creates more of itself. In this case, the energy is stored in the mass energy of 
the new stuff that is created. This sounds absolutely crazy. But there are good reasons to believe 
that this stuff really exists. It is called “relativistic string”. In particular, the glue that holds quarks 
and antiquarks together seems to behave like a relativistic string. Let me discuss just a bit how this 
goes. 

The simplest state involving quarks is a state of one quark and one antiquark. In many ways, 
this is like a bound state of an electron and its antiparticle, the positron, which is called positronium. 
The force that holds quarks and antiquarks together not the same force that holds the electron and 
the positron together in positronium. The force in positronium is just the Coulomb force that you 
are familiar with between ordinary charged particles. The force between quark and antiquark is 
much stronger and has other peculiar properties, as we will see. Nevertheless, both forces can 
be described in terms of fields and “field lines”. Let me warm up by reminding you about our 
discussion of field lines for the ordinary electric force. 

Field lines begin at positive charges and end at negative charges, but otherwise are continuous. 
The direction of the field lines at any point in space indicates the direction of the electric field 
at that point. And the electric field is the force per unit charge on a charge placed at the point 
in question. The density of the field lines indicates the magnitude of the electric field. So, for 
example, a single positively charged particle at rest has field lines spreading out from the positive 
charge in all directions, uniformly. The density of the fields lines drops off like 1/r 2 because the 
field lines at a distance r from the charge are spread out over a sphere of radius r. Thus the electric 
field drops off like 1/r 2 , which in turn is the reason for the 1/r 2 behavior of the Coulomb force. 
Notice that this depends on being in 3 space dimensions. If I draw a two dimensional representation 
of the field lines, it will give the general idea, but will be quantitatively wrong because the lines 
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only spread out over the surface of a circle as they go out. Nevertheless, these two dimensional 
pictures can help us to visualize what is happening with the field line, like so 



In positronium, it is the electric force associated with these field lines that holds the system 
together. The electron and positron have opposite charges, so the force is attractive. When the 
electron and the positron are pulled apart, the force decreases like 1/r 2 . 

Color force 

In the quark-antiquark system, the force is in some ways similar to the electric force — we call 
it a “color” force. The term “color” is rather whimsical, having nothing to do with real colors. 3 
Very close to the quark and antiquark, there is a color field that spreads out from the charges like 
the electric field from a positron or electron. But for reasons that are complicated and only partly 
understood, at distances far from the quark and antiquark, the color electric field lines organize 
themselves into a tube, as shown below: 



( 12 ) 


This tube behaves approximately like a relativistic string (the difference is just the end effects 
near the quark and antiquark, and we will always ignore these). When I pull the quarks farther 
apart, the tube just gets longer. The energy is stored in the color electric field of the string, but that 
just gives some fixed energy per unit length because the tube looks the same everywhere except at 
its ends, as illustrated below: 



(13) 


The extra energy in the stretched string is xT where T is the energy per unit length of the string. 
Because the energy of the string increases linearly with length, that means that the string exerts a 

3 It is actually based on an analogy with the way the primary colors can be combined to produce “colorless” white 
light, but this won’t be important for what I am going to tell you. 
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constant force T at its ends. Thus the energy per unit length of the string in its rest frame, T, is 
called the “string tension”. 

Relativistic strings are extremely complicated if you think about how they bend and oscillate. 
But it is not too hard to understand how they move if they are stretched in a single dimension, 
and if they have particles (like the quarks) stuck to their ends. This is the only thing we will do 
in this course. In this case, we can just apply what we know about force and rate of change of 
momentum to understand what is going on. The situation is easiest to understand if the particles 
are only moving in the dimension in which the string is stretched, so the string has no transverse 
motion. The system then looks like this (let’s assume that everything is in the x direction and from 
here on, we will forget about any width to the string and just draw it as a fat line): 


(E2,P2) 


{El, Pi) 


(14) 


X2 


Xi 


V 2 = P2/E2 


V\ = Pi /El 


where the momenta and velocities all refer only to the x component. Note that each of the particles 
may be moving in either the plus or minus x direction. 

Relativistic strings have an absolutely bizarre property. You cannot tell whether they are mov¬ 
ing in the direction in which they are stretched. That is because they are completely featureless. 
There is nothing to show you that they are moving. You may be able to see this for the string 
we discussed above made out of a tube of field lines. If the system is moving, the field lines are 
contracted, but as long as there is no transverse motion, the lines don’t get closer together, and 
therefore the density doesn’t change. But it is the density that determines the strength of the field, 
so that means that the field doesn’t change and the energy per unit length is the same. 

So in (14), you need not think of the string as moving where it is attached to the particles at 
the ends. Rather, you can just as well think of the string as just sitting there and getting created or 
eaten up as the particles move. It doesn’t matter which picture you use. These two very different 
sounding picture are actually physically equivalent! 

Because the string is being created and eaten up as the particles move, there is a force on each 
of the particles equal to the string tension, T, which is the mass per unit length of the string. Thus 


dpi 

dt 


dp2 

dt 


(15) 


where the signs work this way because the string is always trying to contract into nothingness to 
reduce its energy, and 


dEi 

dt 


-Tvi 


dE 2 

dt 


T v 2 


(16) 


or equivalently 


dEi 


= -T 


dEo 


= T 


(17) 


dx1 dx2 

Note that the total momentum is conserved, and the total energy is conserved if the energy stored 
in the string is properly included. The energy change in (17) just comes from the string that is 
eaten up or created. 
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A relativistic oscillator 


I will now give you what I find a very amusing puzzle. Imagine that I have piece of relativistic 
string with string tension T stretched along the x axis, and that stuck to the two ends, there are 
massless particles, a quark, q, and an antiquark, q, which also move along the x axis. If the total 
momentum of the quark and antiquark is zero, this is a relativistic oscillator. Suppose that at time 
t = 0 (in the lab frame), the quark and antiquark are both at x — 0, with the quark moving in the 
+x direction with momentum n and the antiquark moving in the —x direction with momentum 
—k. The corresponding energies are both k. because these are massless particles. At this point, 
there is no string, because both particles are at the origin, so the total energy and momentum of 
this system is 

(k, kx) + (k, — kx) = (2k, 0) (18) 

Now as the two particles fly outward at the speed of light, they leave string in their wake. As 
they move they lose energy T per unit length because the string does work on the particles. The 
magnitude of the momentum is always equal to the energy because the particles travel at the speed 
of light. Thus the particles travel at the speed of light in the same direction until at t = k/T. having 
traveled a distance k/T, they have lost all their energy and momentum, and they turn around and 
start back in towards the origin at the speed of light. At the tum-around point, all their energy 
is stored in the string, and the quark and antiquark carry negligible energy and momentum. At 
t = 2 k/T, the two particles are back at the origin with the quark now moving in the —x direction 
and the antiquark in the +x direction. We assume that they can pass right through each other 
and continue on until at t = 3 k/T, they turn around again. At t = 4 k/T , the particles are back 
where they started at, at x — 0, and the whole process repeats. Thus this is an oscillator with 
period 4 k/T. This is summarized in the table below, where we describe these things in terms of 
the relevant space-time events 


events 

q 

q 

together at t — 0, x — 0 

(0,0) 

(0,0) 

turn around 

f K, hz\ 

TJ 

( K 

\T ’ TJ 

at x = 0 backwards 

(2?,o) 

(2?,0) 

turn around again 

( 3 ^, — jj 

(3^, 

back at 0 

(4f,0) 

(4$,0) 


This object is animated in STRINGY. 


(19) 


The moving oscillator 

This is a wonderful system to think about, because even though it gets big, we can understand 
exactly how it looks in different frames, because we understand all the forces that hold the system 
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together. But here is the puzzle. How can this thing possibly move? Massless particles always 
travel at the speed of light in ANY frame. When we go to another frame, for example one moving 
with velocity v in the +x direction, in which we are moving towards the oscillator, we should see 
it moving with velocity —v. But at t = 0, the two quarks are both moving out from the origin at 
the speed of light, in both frames, so the oscillator appears to be standing still! There is more string 
growing on both sides - but symmetrically, so that the center of mass remains fixed at the origin. 
What is going on????? 

As with many relativity puzzles, the resolution lies in the relativity of simultaneity. Let us 
see how this works by explicitly doing the Lorentz transformations of the events in (19) to a 
frame moving with velocity v in the +x direction. The first thing you notice is that (19) is a 
little misleading, because some of the rows refer to single events, when the quark and antiquark 
are in the same place at the origin, but others actually refer to two events, when the quark and 
antiquark turn around on opposite sides of the system. With this confusion corrected, the result 
looks like this: 


lab frame 

moving frame 

Q 

Q 

Q 

Q 

(0,0) 

(0,0) 

(0,0) 

(0,0) 

( K K,\ 

\T’ T) 


--j 

1 

HI* 

1 

HI* 



f K K, \ 

\T ’ TJ 


7 ((l + u)f,-(n + l)f) 

(2?,0) 

(2f,0) 

7 (2f,-2uf) 

7 (2f,-2uf) 


(3^, 


7 ((3-u)f,-(3u-l)f) 

(3T’ _ t ) 
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( 20 ) 


Now we see explicitly the fact that the turnaround times for the quark and antiquark, which were 
the same in the lab frame, are different in the moving frame because these events take place at 
different points in space. The simplest way to understand how this actually enables the object to 
move is to see it in the animation, STRINGY. 

We can also understand what is happening in (20) by thinking about the forces involved. At 
t — 0 in the moving frame, while there is no apparent motion of the system because the quark and 
antiquark are moving at the speed of light in opposite directions, they do have different momenta. 
The quark momentum is red-shifted, because it is moving away from observers in the moving 
frame, to 


K ■ 


1 + V 


= (1 — v) 7 K 


( 21 ) 


The antiquark momentum is blue-shifted, because it is moving towards observers in the moving 
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frame, to 


k \T^— = ( l + v)'yK (22) 

V 1 — v 

The forces on these two quarks in the moving frame are exactly the same as in the lab frame! 
The reason, as we discussed earlier, is that the relativistic string is completely featureless. Looking 
at a straight piece of relativistic string, if you cannot see the ends, you cannot tell whether it is 
moving or not. Thus the force on the particles is ±T. If you don’t believe this argument, you can 
verify that it is correct (much more laboriously) by understanding how to Lorentz transform forces. 
The result is that the component in the direction of the Lorentz transformation is unchanged, while 
the components transverse to the direction are reduced by a factor of I /7 (we will see this more 
explicitly later). The consequence of all this is that the quark in the moving frame loses all its 
energy and turns around in a time (1 — u)y k/T, in agreement with (20), while the antiquark, which 
began with more energy, doesn’t lose it all and turn around until (1 + v)j k/T. In the intervening 
time, while both quark and antiquark are moving the —x direction, the string continues to pull in 
the +x direction on the antiquark and in the —x direction on the quark, so energy is continuously 
transferred from the antiquark to the quark until the antiquark turns around. I like to call this 
system a relativistic push-me-pull-you, for obvious reasons. 

Note that the comment about the “center of mass” on page 8 was designed only to confuse you. 
Center of mass has no significance in a relativistic system. What matters is center of energy. You 
will explore this further on the problem set. 
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lecture 15 

Topics: 

Rocket motion 
Review of circular motion 
Spinning relativistic string 
Relativistic traffic 


Rocket motion 


The surprising thing (to me) about relativistic rocket motion is that there is a sense in which it 
looks just the same as nonrelativistic rocket motion - at least in the rest frame of the rocket. Non- 
relativistic rocket motion looks like this if the fuel is ejected at speed u: 



Pr = 0 

t = 0 
Pf = 0 

p r = m dv 
t = dt 
Pf = —dmu 


momentum conservation 


dp dv dm 

— — m — — —— u = Force 
dt dt dt 


( 1 ) 


Relativistic rocket motion looks like this: 


dm 


m 



E r = m 
Pr = 0 
t = 0 
Pf = 0 
Ef = dm 


u 





momentum conservation => 


E r = m 
p r = m dv 
t = dt 
Pf = u dm 
Ef = dm 
NOT drrrf u 


dp dv dm 

— — m — — -r- u = Force 
dt dt dt 


( 2 ) 


There are two compensating differences. Mass is no longer conserved in the relativistic situa¬ 
tion, so it is not correct to say that the decrease in mass of the rocket is equal to the mass of the 
fuel ejected. But it is correct to use energy conservation and say that the decrease in energy of 
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the rocket (which in the rest frame is just the decrease in mass) is equal to the energy of the fuel 
ejected. But then the momentum of the fuel ejected is not u times the mass, but rather u times the 
energy, which is the same as we got in the nonrelativistic case. 

There are lots of differences that emerge as the velocity of the rocket becomes large, but these 
are entirely the result of time dilation and the different relation in relativity between the momentum 
p of the rocket and its speed, v. 

Review of circular motion 

You have all learned in previous physics courses about uniform circular motion. I thought that it 
would be useful to review this before going on to discuss rotations in more generality. So let us 
consider an object that is moving in a circle of radius R centered at the origin with a constant speed, 
v. This is motion in a single plane (because every circle lies in some plane), so it is convenient to 
choose our coordinate system so that the motion is in the x-y plane with z — 0. We can also choose 
to have the motion in the counterclockwise direction as seen from above (by choosing which is the 
positive z direction). Then the motion of our particle can be written as 

x(t) = .R cos (cut + 0) y(t) = Rsm(ut + 0) z(t ) = 0 (3) 

where uo is the angular velocity in radians per unit time. This defines the position vector of the 
particle, r (f), in the usual way: 

r (t) = x(t) x + y(t) y + z(t) z = x R cos(c ot + 0) + y R sin (uot + 0) (4) 

This should be familiar. We saw it earlier in the course in another context when we were discussing 
the connection between uniform circular motion and complex exponential. This motion in the x-y 
plane is precisely the same as the motion of the complex exponential 

R e lujt ( 5 ) 


in the complex plane. 

The velocity of the particle is the derivative 

v (t ) = r (t ) = — x Ru sin(c ot + 0) + y Ru cos(c ot + 0) (6) 

Note that because sin 2 6 + cos 2 9 = 1, the length of v (t) is Ruj, so sure enough, the speed of the 
particle is constant, 

v — \v (f)| = Ruj , (7) 

even though the velocity vector is constantly changing. 
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The relation between (3) and (6) is illustrated below at time t (for to > 0): 


y 


( 8 ) 


Note that v (t ) is always perpendicular to f (t ) and still in the x-y plane (you can see this from the 
diagram, but you can also see it by explicitly computing the dot product, r (t) ■ v (t) and showing 
that it is zero for all t). 1 

We can compute the acceleration of the particle by differentiating (6 ). 

a(t) — v ( t) = —x Ru 2 cos(u )t + 0) — y Ru 2 sin(o;f + 4>) = — u 2 r(t ). (9) 

The accleration in (9) is always directed to the center of the circle and the magnitude of the accel¬ 
eration is u 2 R, or, using (7 ), this gives the (I hope familiar) formula 

v 2 

a = \a (t) ( = uj 2 R = — (10) 

it 

'in fact, r (t) ■ v (t) = 0 for any motion on a circle, centered at the origin, whether it is uniform or not. You can 
see this by differentiating r (t) ■ r (t) = R 2 . 
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Here is (8) again with the acceleration shown this time. 



( 11 ) 


The moral of this is much more general than it appears. Note that for uniform circular motion, as 
we go from r to v = r to a = v = r ., each time we turn the vector counterclockwise by 90° and 
multiply by uj. We have thought about this in the context of a position vector. But any other vector 
(as we have already seen with v) that undergoes uniform circular motion behaves the same way. 
We can get the time derivative by turning counterclockwise by 90° and multiplying by u. This is 
telling us something very deep and important not just uniform circular motion, but about rotations 
and motion in general. We will begin to explore it in more generality in the next lecture. 

Here is a simple application that is related to something that we know in various other ways. 
Suppose that we have a force of the form 


F(r) 


-a rP r 


( 12 ) 


directed toward the origin with magnitude r@. For what value of 6 do the circular orbits of a mass 
subject to this force have the same angular frequency independent of r? We want 

F(r) =—ar^ r = ma =—into 2 r f (13) 


so 


a 

771 


2 8-1 
uj — — r 


(14) 


and if f3 = 1 the angular frequency is independent of r. This is just a harmonic oscillator in more 
than one dimension. 


Spinning relativistic string 

Next time, we are going to start talking more seriously about the concept of angular momentum. 
We will find that it arises from the general principle of rotation invariance, and as such applies just 


4 








as well to relativistic systems as in nonrelativistic mechanics. Indeed, one of the ways we know 
that relativistic string exists is by studying systems of quarks and antiquark that carry angular 
momentum. Let us study such a system. Consider a system of a massless quark and antiquark 
connected by relativistic string with tension T in its rest frame. Let us determine the conditions 
under which this system can rotate with the string straight and the quark and antiquark moving in 
a circle of radius i as illustrated below: 


v — 1 

(15) 



The first thing to note is that the quark and antiquark must move at the speed of light because 
they are massless, by assumption. This has three immediate consequences. One is that the angular 
velocity of the system is (n relativistic units) 

w = j (16) 

The second is that the quark and antiquark can carry only negligible momentum. This is be¬ 
cause the force from the string at the end is Lorentz transformed to zero. You saw in Morin’s 
book that the force from a string moving in the transverse direction with velocity v is reduced to 
Ty/1 — v 2 . This effect is simply time dilation. The transverse components of momentum do not 
change under a Lorentz transformation. Because we see moving clocks tick slowly by a factor of 
l/y, we also see the rate of change of transverse momentum reduced by a factor of I/ 7 . In this 
case, where the quark and antiquark at the end are massless and move at the speed of light, 1/7 = 0 
and thus the rate of change of the momentum of the quark and antiquark must vanish. But the rate 
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of change of momentum of a particle with momentum of magnitude p in uniform circular motion 
is pu. Because u ^ 0, we must have p — 0. Thus all the momentum and angular momentum in 
this system is carried by the string. 

The third consequence is that at a point on the string a distance r from the center is moving 
with velocity r/i, as shown in (15 ). 

Next, let us check that the system can hold itself together. The force from the string at some 
radius r 0 must be right to produce the appropriate rate of change of momentum, which is the change 
of momentum associated with the uniform circular motion of the part of the string for r > r 0 . To 
see that this works, let’s calculate the momentum carried by the portion of the string for r > r 0 . 
The contribution from an infinitesimal bit of string between r and r + dr is 

dm T dr r 


\/l - v 2 sjl - r 2 /f 

Then the momentum carried by the string for r > ro is 

T dr r 


2 £ 


(17) 


Jr 0 y/f - r 2 j £ 2 £ 

If we change variables to £ = r ft, this becomes 

ZdZ -T£ f 1 dz 4- = TiJl- r 2 /£ 2 

Jrn/e dz v 


(18) 


Ti 


Iro/i \/l — z 2 

The rate of change of momentum is then 

pu = p/£ = T yjl — rl/£ 2 


(19) 


( 20 ) 


This is right, because it reproduces the string force at r 0 

Now that we know how this object moves and that it hangs together, we can ask what its energy 
and angular momentum are (it is at rest, so the total 3-momentum is zero). The energy from a small 
bit of string is just like (17) but without the factor of r/£ at the end. The total energy is then 

T dr „ „ r 1 dz 


dr 


me 




= Ti 


(-1 vT 


This should be a familiar integral — we can make the trigonometric substitution 

z = sin 9 dz = dO cos 0 

to write it as 


E = Ti 


nn/2 
/—7T / 2 


de = 7iT£ 


( 21 ) 


( 22 ) 


(23) 


To compute the angular momentum, we need an additional factor of rv = r 2 /£ in the integrand 
of (21): 


dr 


T r 2 dr 


-i isj 1 - r 2 f £ s 


= Ti 2 


r 1 z dz 
l-i \Jl — z 


(24) 
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The same trigonometric substitution allows us to write this as 

L = Tf r de sin2 0 = -kTI 2 (25) 

J-n/2 2 

Now the point is that we can eliminate £ from (23) and (25) and write this as a relation between the 
energy and angular momentum 

E 2 = t t 2 T 2 £ 2 = 2 vrT L (26) 

Sure enough, when we look at quark-antiquark bound states, we see families of particles with sim¬ 
ilar properties in which E 2 grows linearly with L within each family. 2 Here is some representative 
data: 



The data shows a slope of about 


2nT « 


1 GeV 2 

h 


(27) 


From this we can extract an approximate value for T, which works out to about 15 tons! These 
little particles are pulled by a rather strong force! That’s why quarks and antiquarks don’t get very 
far apart! 


2 See for example http://arxiv.org/abs/hep-ph/0103274 
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Relativistic traffic 


Consider relativistic traffic on a Boston street. We will make the completely unrealistic assumption 
that the traffic is flowing smoothly, with evenly spaced cars moving in opposite directions at speed 
v as shown below. 


V \ V \ «— v V \ V \ V \ V \ V \ V 


I V —> ] | V —> | | V —> ] | V —> | | V —> | | V —> | | V —> | | V —> | | 


(28) 

You will notice that there is something else that is unrealistic about this picture. There are plentiful 
parking spaces on the side of the road. This picture is animated in the Mathematica program 

traffic.nb. 

Now let us look at this picture from the inertial frame of a driver in the lower lane of traffic. In 
the animation, you can change the inertial frame gradually. Try increasing the speed of the frame 
gradually, until you to get to an inertial frame that is moving along with the lower cars. Of course, 
as you increase the speed of the inertial frame, the parking spaces appear to move in the opposite 
direction, and the cars in the other lane move by faster. But here, we also see a number of effects 
of Lorentz contraction. The animation assumes that the initial speed of the cars is 0.9c, so that 
things are very relativistic and Lorentz contraction is a big effect (and likewise the cars moving 
in the opposite direction don’t move much faster because they were already moving at close to 
c in the original frame. I have done this to make the important effects show up better. In the 
actual physical demo you will see, the effect is much smaller, but the qualitative message is the 
same. Notice that as the lower cars slow down in the moving frame, they also appear longer. This 
is because they appeared Lorentz contracted in the initial frame, and going to the moving frame 
undoes some of the effect. The parking spaces and the upper cars going in the opposite direction 
are Lorentz contracted more and more as the speed of the frame increases. Finally, we get to the 
frame in which the lower cars are at rest, which looks sort of like this: 
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v = 0 


v = 0 


v = 0 


(29) 

There are several things to notice about this. One is you see all the other cars in your lane farther 
apart than they were in the frame of the road, and they don’t seem to fit into the parking spaces 
anymore. This is just Lorentz contraction. This is the same thing that is going on the problem of 
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the relativistic chocolate chip cookies in David Morin’s book. There you remember that the cookie 
dough was moving at relativistic speed on a conveyor belt and a circular cookie press was stamping 
out cookies - but the because the dough is Lorentz contracted, the cookies come out elongated. 
The cars going in the other direction are Lorentz contracted even more, and look shorter and closer 
together. 

Something may be bothering you about this. How can the density of parking spaces change 
relative to the density of the cars? Where do those extra parking spaces come from? If we can 
harness this, we will be rich beyond the dreams of avarice. In fact, we can’t make any money on 
it, because these extra parking spaces are coming in from the past and the future. This is possible 
everywhere if we have an infinite line of them stretching out in both directions, but it is always 
possible in some finite region of the roadway. The density in the rest frame is not the same as the 
density in a moving frame because of the relativity of simultaneity (which is the culprit in most 
such puzzles). But again, the effect is even larger for the cars going in the opposite direction. 
Because they are moving faster, they are even more Lorentz contracted than the parking spaces. 

Of course, I don’t really have any relativistic cars available to test these theories. But there 
is a very interesting analogy. Instead of two opposing lanes of traffic, consider current in a wire 
flowing up and back from a battery. 



In this analogy, the negatively charged electrons are the cars and the rest of the electrical structure 
of the wire form positively charged parking spaces. Of course the actual motion is more chaotic — 
more like real Boston traffic. But the net current is as shown. The whole system looks electrically 
neutral because there is one car for each parking space. Now the question is, what forces do the 
components of one of the wires feel do to the moving electrons in the other. We expect from the lab 
frame where everything is neutral that there is no force on the fixed positive matrix - the parking 
spaces. And since there is no force in the lab frame, there is no force in any frame. But what force 
do the electrons feel. Now it is not so obvious, because the electrons are moving in the lab frame. 
So even though everything is neutral, there might still be a force that depends on velocity. Thus 
it is simplest to think about it in the inertial frame moving along with the electrons in one of the 
wires. In this frame, any force on the electrons proportional to velocity goes away, because they 
are at rest. But in this frame, as the animation or (29) shows, the electrons in the lower wire see 
far more negatively charge electrons in the upper wire than they see positively charged parking 
spaces for the electrons. Thus they experience a repulsive force. In the original frame, there is no 
Coulomb force because everything is neutral. In this frame, the repulsive force is still there, but it 
is interpreted as magnetism. When I connect the battery and get my two lanes of traffic moving, 
there should be a repulsive force between them. 

Thus as you will see in more detail in Physics 15b, magnetism is a relativistic effect. When 
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you feel the mysterious force between two magnets, that I am sure has fascinated all of you at one 
time or another, you are feeling in your bones the effect of special relativity. 
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lecture 16 

Topics: 

Where are we now? 

Cross products - introduction 
More general motion about an axis 
Small rotations 
The reference point 

Rigid bodies rotating about a fixed axis 

The three dumbbells 

Appendix - cross products - details 

Where are we now? 

Now we are going to change gears and talk about angular momentum. We will see that conservation 
of angular momentum is associated with the invariance of the laws of physics under rotations. In 
that sense, the subject of angular momentum is not so different from relativity. Both are related to 
symmetries of the laws of physics. 


Cross products - introduction 


Last time, we reviewed uniform circular motion. To get a deeper understanding of circular motion 
in three dimensions, and to ready ourselves to study the more general problem of rigid body ro¬ 
tations, we will reformulate the discussion of uniform circular motion in terms of cross products. 
The advantage of this reformulation is that it makes the vector structure of the rotations more ex¬ 
plicit. This is what we will need to take the next steps. Cross products are going to be particularly 
convenient as we talk about rotations and angular momentum. 
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Look at (1), from last time. As we go from the position, f, to the velocity, v, to the acceleration, 
a, each time I just rotate the vector by 90° and change the length by a factor. We can rewrite this 
in a very useful way using cross products. The cross product can be defined by 


AxB 


— AyB Z — A Z By , 


AxB 




— A z B x — A X B Z , 


AxB 


— A X By — AyB X . (2) 


This looks complicated (I hope that it will seem simpler after you have read the discussion in the 
appendix). The key point is that the formula for the velocity can be written as 


— r (t) = r(t) — uj z x r (t) (3) 

at 

because r (t) satisfies 

r (t) ■ z = 0 . (4) 

The zx produces the 90° twist, and the factor of u rescales the vector by a factor (necessary in this 
case just to get the dimensions right). 

You can see from the explicit definition in the appendix the following property: 


If a is in the x-y plane, then zx a is also in the plane, it has the same magnitude as (5) 

a, and its direction is that of a rotated by 90° counterclockwise about the z axis. 


This is just the right-hand rule. If a is not in the x-y plane (as you can see in more detail in the 
appendix) then the cross product just throws away the component that is not in the plane and acts 
in the usual way according to (5) of the component in the place. 

The relation (3) is very general. Not only is the time derivative given by this cross product for 
the position vector, but any vector undergoing uniform circular motion in the x-y plane behaves 
the same way. 

For any vector u (t) undergoing uniform circular motion in the x-y plane, the time (6) 
derivative is given by the cross product 

—u (t) — UJZ X u(t). 


In particular, this gives us another way of calculating the acceleration. The velocity vector is also 
undergoing uniform circular motion. Thus 

a (t) — u z x v (t) — u z x (u z x r(t)') = —u> 2 r (t) (7) 

Where the last step follows because each cross product with z just rotates r by 90°, so the two 
cross products give a rotation by 180°, which is just a sign change. 

One of the crucial things about describing uniform circular motion in this way with cross 
products is that (3), (4) and (7) are vector equations. The z appears in (3), (4) and (7) because 
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rotation in the x-y plane is rotation about the z axis. But because these are vector equations, we 
can rotate them to an arbitrary coordinate system. Then we can describe uniform circular motion 
in any plane through the origin, about an arbitrary axis through the origin. If the rotation is in some 
other plane through the origin, say perpendicular to a unit vector n (it is then rotation about the 
axis, h), then the rotated versions of (3), (4) and (7) are 


v (t) = — r (t) = uj n x r(t), 


( 8 ) 


r (t) ■ h = 0 


( 9 ) 


and 


a (t) 


— v (t) — uj n x (u; n x r(t)j = —uj 2 r (t ), 


( 10 ) 


where the last step follows because the cross product with n on a vector perpendicular to it just 
produces a rotation of 90° in the plane perpendicular to n. Remember that (8 ) works only for 
motion that is circular! But still as we will see this is very powerful. 


More general motion about an axis 


Let’s go back to circular motion about the 2 axis and think briefly about what happens if the vector 
r is not in the x-y plane. In this case, uniform circular motion about the z axis looks something 
like this: 



(ID 


The velocity is in the x-y plane. What is going on is that the component of r perpendicular to z is 
undergoing uniform circular motion in the x-y plane while nothing is happening to the component 
of r in the z direction. This is very elegantly encoded in the cross product formula, (3) if we just 
drop the condition (4) that r be perpendicular to z and thus in the x-y plane. Thus 


v (t) = uo z x r{t) 


( 12 ) 


is more general than we claimed above. Furthermore, there is really no reason to restrict ourselves 
to uniform circular motion. If at some time t, the vector r is instantaneously rotating about the z 
axis with angular velocity c j(t), the velocity is 

(IT 

-jjt= v(t) =u(t)z x r (t) (13) 
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Finally, again because (13 ) is a vector equation, we can describe instantaneous rotation about 
an arbitrary axis h by just replacing 3 —> h: 


r 



(14) 


Thus we can make the following very general and very important statement: 

If at some time t, the vector r is instantaneously rotating about the n axis through (15) 
the origin with angular velocity u(t), the velocity is 

dv 

— = v (t) — u(t) h x r(t) 


This is a very general connection between a rotation and a cross product. Eventually, we will 
discuss what happens if the rotation is about an axis that does not go through the origin - but this 
is enough for now. We will use (15) very often so make sure that you understand it. 

Small rotations 

It will also be useful to rewrite (15)as an expression for the instantaneous change in a vector under 
an infinitesimal rotation, which we can do by noting that 

, , dd 

= j t (16) 

so that multiplying both sides of (15) by dt gives dr, the infinitesimal change in the vector f under 
an infinitesimal rotation dd, 

df = dd n x f (17) 

One reason for thinking about infinitesimal rotations is that we know that infinitesimal symme¬ 
try transformations lead to conserved quantities by Noether’s theorem. For infinitesimal rotations, 
the corresponding conserved quantity is a component of the angular momentum. 

For a 3-dimensional system described by some number of 3-vector coordinates r 3 , the con¬ 
served quantity associated with invariance under rotations about an axis n is 

L ■ h (18) 

where 

L = Pi ( 19 ) 

3 
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where pj is the momentum of the jth particle, 



dC 


( 20 ) 


If the Lagrangian is invariant under all rotations, all components, and therefore the entire vector, L 
is conserved — L is the angular momentum — this is angular momentum conservation. 


What I think is worth understanding about this is the way that the appearance of the cross 
product in rotations leads to the appearance of the cross product in the expression for the angular 
momentum. This may have seemed mysterious to you, even as you followed the derivations in 
previous physics courses showing that angular momentum is conserved under certain situations. 
What is really going on is conservation of angular momentum appears naturally from Noether’s 
theorem in a Lagrangion that is invariant under rotations. The mysterious cross product comes 
simply from the mathematical description of infinitesimal rotations in 3-dimensional space. 

The reference point 

So far we have found the velocity of a vector rotating around an axis that goes through the origin 
of the coordinate system. We used this to find the change in a vector due to a small rotation around 
an axis through the origin and from this we derived the form of the conserved angular momentum 
that follows from invariance under such a rotation. I now want to remove the restriction that the 
axis goes through the origin, so that we know what happens for a totally arbitrary rotation. This is 
not difficult. If the axis goes through some arbitrary point R and the axis points in the direction fi, 
the velocity of the point fj is given by 



( 21 ) 


This must be right, because subtracting R from every vector in the original coordinate system does 
not affect the velocities (because R is fixed) and takes us to a coordinate system in which R is the 


origin, in which (21) is equivalent the result we derived above. Notice that because n x h = 0, we 


can add to R any multiple of n without affecting (21): 


ton x (fj — R) = ton x (fj — R — an) 


( 22 ) 


Following the same argument that we gave above, we see that if a Lagrangian C(f, f) is invari¬ 
ant under rotations about an axis through the point R in the n direction, then Noether’s theorem 
implies that there is a conserved quantity of the form 



(23) 


and if C depends on several vectors, the conserved quantity is 


n-L A = n-^2(fj-R) x Pj 


(24) 
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If the Lagrangian is invariant not just under rotations about the axis n through R, but under 
rotation about any axis through R, then the entire angular momentum vector is conserved, 

L n = (f-R)xp (25) 

and if C depends on several vectors, the conserved quantity is 

Ln = '52( ff j-R)x-Pj (26) 

3 

In this case, the point R is called the “reference point” and we talk about L as “the angular 
momentum about the point /?,.” In this case, there is no ambigiuty in 11 (that is there is no analog 
of (22)). In (26) (and more trivially in (25)), you see that the L f ; is independent of R if the total 
momentum vanishes. 

Rigid bodies rotating about a fixed axis 

Now that we know from our study of relativity that rigid bodies don’t exist, we are going to study 
them in detail. Of course, this is not quite as stupid as it sounds. What we mean by a hypothetical 
rigid body is one in which the shape is completely fixed. Mathematically, we can describe this 
by saying all the lengths of vectors between different parts of the system are fixed by the internal 
dynamics of the system. We know that this can only be an approximation. Whatever the internal 
dynamics is that keeps the system rigid, it will always be possible to deform the system slightly. 
But it can be a very good approximation if we are concerned only with relatively slow motions of 
the whole system. What we really mean when we say that a body is rigid is that all the modes that 
correspond to deformations of the system have very large frequencies, and/or very large damping, 
so that we can ignore them. 

The most general continuous motions that preserve the shape of a rigid body are combinations 
of translations and rotations. This crucial fact is proved in David Morin’s book, but I hope that it is 
obvious to you. In fact, we can always describe the position of a rigid body by giving the position 
of any convenient point on the body, and specifying the rotation required to get to its orientation 
from some standard orientation. For bodies that rotate only about a single axis, the situation is 
simpler, because we can specify the rotation by just giving the angle 6 of the system from some 
fixed position. Thus a rigid body rotating about a fixed axis is a system of one degree of freedom. 

Suppose that the fixed axis is in the n direction, and that the axis goes through the origin of 
our coordinate system. Then when the body rotates, every part of the rigid body executes circular 
motion about some point on the axis, and every part moves with the same angular velocity. The 
radius of the circular motion is the distance from the axis. This is illustrated below for a square 
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made of balls and sticks: 


( 27 ) 


The kinetic energy of such a rigid body is given by the sum of \mv 2 for each of the parts of 
the system. 1 We are going to compute the kinetic energy in terms of the angular velocity, 9, and 
discuss it. I will do this for an arbitrary axis h, because I want you to get used to dealing with 
arbitrary axes. But because I also want you to understand what I am talking about, I will repeat 
some of the steps for the special case of rotations about the z axis — that is fi = z. 

Suppose that the system is made up of point masses with positions at some time t given by the 
vectors fj = (x 3 , y 3 , z 3 ), and that the angular velocity of the system about n at time t is 0. Then 
the velocity of the point with position fy is 

9 n x fj (28) 

It is perpendicular to n and f 3 because of the form of the cross product. For n = z, this looks like 

6 z x f 3 = 6 (—ijj , Xj, 0) (29) 

— the velocity is in the x-y plane, perpendicular to 

Thus the kinetic energy for rotations about the fi axis has the form 

^ Y. 77ij 9 2 \n x fj\ 2 = ^ J0 2 (30) 

where 

/ = m 3 |n x fj\ 2 (31) 

3 

For n = z, this looks like 

1 = Y, m 3 \z X rj\ 2 = J2 m 3 (■ x ) + Vj) (32) 

3 3 

The quantity / in (30) and (31) is called the “moment of inertia” about the axis n. As you see, 
it summarizes the behavior of the body under rotations about the axis n. The length 

|n x fj\ (33) 

'if the system is continuous, we must replace the sum by an integral, but this doesn’t change anything important. 
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(as you can see immediately for n = z) is just the perpendicular distance of the jth point mass 
from the axis n. This remains constant as the body rotates, so we do not have to specify the time 
dependence of this, and / is independent of time. It is a constant that just depends on the axis and 
the masses in the rigid body and their positions. 

The Lagrangian describing the motion of this system will involve the kinetic energy, (30). If 
we differentiate this with respect to 9, we get the generalized momentum associated with the angle 
9 that specifies the orientation of the rigid body. This is 

L = I9 (34) 


For a rigid body rotating about the h axis, the “momentum” L in (34) is just the component of the 
angular momentum about the axis ft. Let’s prove this. In general, the angular momentum is 

L = ^2 fj x pj = ^2 rrij fj x fj — rrij fjX9 (h x fj) (35) 

j j o 

so the component in the n direction is 


n ■ L = h • ^ rrij rj x 9 (n x fj) = 9 

3 3 

= 9 m 3 (n x L/) • {n x fj) = 9 

j j 


ntj n ■ (fj x (n x fj)) 
rrij \n X fj\ 2 — 19 


(36) 


At the end of (36), we have used the cyclic property of the triple product (a ■ (b x c) = c ■ (a x b) 
— an important fact that keeps coming up in the physics of rotations). The point is that if the 
axis is fixed, it is only the component of angular momentum about the axis that matters. Whatever 
is fixing the axis can provide torques that change the other components of angular momentum 
anyway, so they are not very interesting. But the component in the direction of the fixed axis will 
be conserved unless there is some physics that depends on the angle 9, or some other degree of 
freedom that interacts with the rigid body. 

Just to say this once more in a different way, if the rigid body is free to rotate about the fixed 
axis n, then the Lagrangian is just (30) and (34) is the quantity that is conserved by virtue of 
Noether’s theorem, or equivalently the fact that the Lagrangian is independent of 9. It is clearly 
related to the angular momentum, but it is not the angular momentum — just one component of 
the angular momentum. We will see later that in general the other components are not conserved in 
this ki nd of motion because the physics that is fixing the axis is supplying a torque. These relations 
are very important. We will come back to them next week in a more general context. 


The three dumbbells 

A very simple system that shows some of the bewildering complexity of rotational motion is a 
system of two equal masses m attached to one another by a string and free to rotate without friction 
on a horizontal plane about their common center of mass. The configuration of this system for fixed 
/(just depends on the angle with angular velocity u = 9. 
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If the distance between the objects is 2R, so that both masses are a distance R from the center 
of mass, the angular momentum is perpendicular to the plane of the rotation and equal to 

L — z ■ L — z ■ (f x p^j = 2mR 2 u> (37) 

so in this case the moment of inertia is 

I = 2 m R 2 (38) 

Now suppose the distance between the masses is variable - R(t ') - where the t dependence is 
imposed by pulling in or letting out the string. It is probably obvious that angular momentum is 
conserved in this case, but let’s be pedantic about it and think about the Lagrangian. It is just the 
kinetic energy 

C = mR{tfd 2 + mR(t) 2 = h 6 2 + mR(t) 2 (39) 

Sure enough, the Lagrangian does not depend on 6 , so the momentum associated with 9 is con¬ 
served - and this is just L 

BC 

L = —- = 2 mR{t) 2 9 = 2mR(t) 2 u (40) 

B6 

So if, for example, R{t) is decreasing, lu must be increasing to keep the angular momentum con¬ 
stant. 

This is very familiar. This is the way an ice-skater gets spinning very fast. But it is easy to get 
confused. When you pull on the rope, you are pulling perpendicular to the direction of motion of 
the masses. How is it that this speeds up the masses? In this simple case, it is easy to see how 
everything works. While the rope is being pulled in, there is a component of the motion of the 
mass in the direction of the centripetal force, and thus the centripetal force feeds energy into the 
kinetic energy of the masses, increasing their speed. But still this simple example is a taste of some 
of the bizarre behavior of angular momentum in three dimensions. We will see a lot more of it in 
the next few weeks. 

A simple way to implement this system is to hold a weight in each hand and stand on a fric¬ 
tionless turntable. 

Appendix 1 - cross products - details 

The cross product is essentially just an antisymmetric combination of two vectors. This anti¬ 
symmetric combination of two vectors in interesting because it defines a plane, and planes are 
intimately connected with rotations. The particularly convenient thing about this combination in 
three dimensional space is that it behaves like another vector. The cross product is the mathe¬ 
matical statement of the fact the antisymmetric combination of two vectors in three dimensional 
space defines a plane which in turn defines another vector. The geometrical definition of the cross 
product is a good way to see that it behaves like a vector under rotations, so we will start with that. 
Then I will indicate how we can show that this geometrical definition is equivalent to a definition 
given in terms of components. 
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The geometrical definition is this: 


Given two vectors, A and B, the object AxB is a vector with magnitude (41) 


A 


B 


sin 0 where 9 is the angle between A and B defined as a positive angle 
between 0 and n. The direction of /I x /i is perpendicular to the plane formed by A 
and B with the sign determined by the right-hand rule. 


With this definition, it is easy to understand why AxB behaves like a vector under rotations. The 

and sin 6 are all unchanged. And the 


A 


B 


magnitude doesn’t change under a rotation because 
direction rotates properly because it is tied to the directions of A and B. 

It is crucial that the cross product AxB is antisymmetric in the two vectors A and B, 


A x B = -B x A 


(42) 


In the geometrical definition, this follows from the application of the right hand rule. If you 
interchange A and B, the cross product changes direction because the right hand rule goes from 
B to A rather than from A to B. This antisymmetry ensures that either the two vectors A and B 
define a plane or the antisymmetric combination vanishes. Then the fact that in three dimensional 
space, there is a unique direction perpendicular to a given plane allows us to turn the antisymmetric 
combination into a vector. 

The geometrical definition, (41), is equivalent to the following component definition, 


AxB 


— AyB Z — A Z By , 


AxB 




— A z B x — A X B Z , 


AxB 


A X By AyB X 


(43) 


where we are using a notation for vector components in which 


A 


X 


A, 



A. 


y > 



(44) 


If you have not seen cross products before in your math courses, you should look carefully at the 
demonstration of this equivalence below. We will be using cross products a lot for the next couple 
of months, so you might as well get used to them. 

To prove that (41) and (43) are equivalent, we first show that 


(AxB) ■A= ( AxB ) -B = 0 


(45) 


We can do this by explicit calculation. For example, 

(A xfij ■ A = (A y B z — A z B y )A x + ( A Z B X — A x B z )A y + ( A x B y — A y B x )A z = 0 (46) 


The calculation for B is similar (as it must be because of the antisymmetry of the cross product 
- (46) just says that the dot product of the cross product with the first vector in the cross product 
vanishes - and because of antisymmetry the same must be true for the second vector in the cross 
product). Thus AxB is perpendicular to both A and B and therefore perpendicular to the plane 
they form, just as in the geometrical definition. 
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You can see that the magnitude of the object given by (43) is right by explicitly calculating its 
square. 


x nj ■ (A xB) - ( A y B z — A z B y ) 2 + ( A Z B X — A X B Z ) 2 + ( A x B y — A y B x ) 2 (47) 

= By + A 2 x B 2 z + A 2 B 2 + A 2 B 2 + A 2 B 2 + A 2 B 2 — 2A X B X A y B y — 2A X B X A Z B Z — 2A y B y A Z B Z 

(48) 

If we add and subtract A X B X + A 2 B 2 + A 2 Z B 2 Z to this, the positive terms can be factored into 

(Al + Al + At)(Bl + Bl + Bfj (49) 

and the negative terms into 

— ( A x B x 4- A y By + A z B^j (50) 

so we can write (47 ) as 


(Ax b) 

■(AxB) = 

A 

B 

2 / -> 
-(A-B) = 

A 

B 


'(1 


cos 


d) = 


A 


B sin 2 6 (51) 


Finally, you can see the right-hand rule by calculating an example, like x x y = z. Thus we have 
checked that the component definition (43) is equivalent to the geometrical definition (41). 

We can use the cross product very efficiently to describe uniform circular motion about the 
origin. First notice that the velocity 


v ( t ) = r(t) 


—x Ru sin(o;f + 0) + y Ru cos(ut + 0) 


(52) 


can be written as 
where r (t) satisfies 
Let’s see this explicitly: 


v (£) = uj z x r (t) 
r (t) ■ z = 0 . 


(53) 

(54) 


v (t) 
v(t) 


v[zx r (t)] x = a; ([z\ y [r{t)] z 
a }[z x r(t)] y = uj ([5].[r (f)]^ 
uj[z x r(t)\ z = uj {[z] x [r (t)] y 


[i] z [f ( i )] j ,) = -uj [r(t)] y = —uRsm(ujt + 0) 
[z\x[r (t)] z ) = uj [r(t)] x = ujRcos(ujt + 0) 

[z]y[r{t)\ x ) = 0 

(55) 


(4) is the statement that r (t) is in the x-y plane and (3) describes the effect of the circular motion. 
As we saw, what the cross product of the unit vector z is doing on the vector r in the x-y plane is 
just rotating the vector counterclockwise by 90°. 

Another useful quantity is the so-called “triple product” 

[A x B^j ■ C = A x B y C z — A y B x C z + A y B z C x — A z B y C x + A z B x C y — A x B z C y . (56) 


You can see explicitly that this has the property of complete antisymmetry. It changes sign if any 
two of the vectors are interchanged. This means also that the triple product is “cyclic” — 

(Ax b)-C = (b xc) ■ A= (c x A) ■ B (57) 


This is a good thing to remember. 
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lecture 17 


Topics: 

Where are we now 
Central forces 

Energy and angular momentum 

V (oo) = 0 

The parallel axis theorem 

Example - Planar bodies moving in a plane 

Torque - fixed reference point 

Example — oscillations of a hanging rod 

Torque - moving reference point 

Example — Rolling ring 

Where are we now 

Rotations are important for two reasons. The highfalutin theoretical reason is that the laws of 
physics are apparently invariant under rotations. This leads to the important conservation law 
of angular momentum. The down-to-earth practical reason is that things rotate — and when rigid 
objects rotate, things get complicated and understanding the structure of rotational motion is crucial 
to understanding what happens. 

In the last lecture, we began by reviewing uniform circular motion. We used the connection 
between uniform circular motion and the cross product to find the velocity of a vector rotating 
around an axis through some arbitary point R. 

If at some time t, the vector f is instantaneously rotating about the n axis through (1) 
some arbitary point R with angular velocity u(t), the velocity is 

^ = v (t) — u>(t) h x (r (t) — It') 


We used this to motivate the definition of angular momentum from Noether’s theorem, and to 
define the moment of inertia of an object rotating about a fixed axis. 

Today, after a quick review of central forces and the effective potential, we are going to go on 
with our dicussion of moments of intertia about fixed axes, talk about the parallel axis theorem and 
discuss torque. 

Central forces 

One of the most important examples of gravity in action is the solar system, where the sun is much 
more massive than all the other bodies, so the center of mass is essentially at the position on the 
sun. Then to a good approximation, all the other planets orbit the sun. The gravitational forces of 
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the planets on each other are very small compared to the force of the sun and can be ignored to 
first approximation. This means that we can put the sun at the origin and look at the orbit of one 
planet at a time. This in turn is an example of a more general system with a central potential, with 
Lagrangian 

1 • 9 

( 2 ) 


1 2+2 

C(r,r) — -mr —V{r) 


where 


r = 


(3) 


This is invariant under rotations about any axis through the origin so there is a conserved angular 
momentum, 

L — fxp — mfxf— mfxv (4) 

While we know formally that the angular momentum is conserved because of Noether’s theorem, 
it is instructive to see how the conservation arises in this particular case. Using the product rule, 
we can write 

d - d ■ ■ 

(5) 


d -f d ^ ^ ^ ^ ^ u, 

— L = m — r x r = mr x r + mr x r 
dt dt 


The first term vanishes trivially because A x A = 0 for any vector A because of the antisymmetry 
of the cross product. The second term vanishes because 


_ Q Qr 1 

mr = F = --—V (r) = — — V'ir) = — fV'(r) = —r V'(r) 
or or r 

so the second term is proportional to r x r — 0. The crucial step here is 

dr 1 _ 
dr r 


( 6 ) 


(7) 


We have talked about this relation before, but it is so important that I want to give you several ways 
of remembering it. This can be seen in various ways. One of the nicest is to differentiate both sides 
of the equation r 2 = r ■ r, 

d „ dr d 

( 8 ) 


d 2 o dr 9 _ _ 

— r = 2r — = —r ■ r = 2r 
dr dr dr 


from which (7) follows immediately. Another way of seeing this handy fact is to think about what 
r looks like considered as a function of the components of r. This is easier to visualize in two 
dimensions, where we can plot r = \Jx 2 + y 2 versus x and y, and the result is a cone standing on 


2 





its tip: 


r 



with the y axis going into the paper. Now is the gradiant operator V that points in the direction 
in which the function it acts on is increasing, and has magnitude equal to the rate of increase. But 
the r function described by the cone increases linearly as one goes away from the origin in every 
direction, which is another way of seeing (7 ). 

Energy and angular momentum 

One of the most important consequences of angular momentum conservation for motion in a central 
force is that the motion is confined to a plane. This follows because L = r x p = mr x v, it is 
perpendicular to both r and v: 

r ■ L = v ■ L = 0 (10) 

Therefore both the position vector and the velocity are always in the plane through the origin 
perpendicular to L. 

It is therefore very convenient to analyze the motion in polar coordinates in the plane of the 
motion. Let us rotate our coordinate system until the motion is in the x-y plane. The angular 
momentum is then in the 0 direction, and we can define our polar coordinates by 

x = rcos9 y = rsin6 (11) 

In this coordinate system, the problem of finding the motion reduces to an equivalent problem 
in two dimensions. Because we know from the outset that z is going to be zero throughout the 
motion, we can ignore z completely and describe the system in terms of r and 6. 
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The square of the speed looks simple in both cartesian and in polar coordinates because 


• 2 i -2 

x + y 


•2 , 2 n2 

r + r 9 


( 12 ) 


which you can see either directly by differentiating (11) or by staring at the picture below and 
noting that the component of the velocity in the radial direction is perpendicular to the component 
of the velocity in the angular direction. 



The Lagrangian for this equivalent two dimensional problem is 


£(r, 9, r, 6) 


-mr 2 
2 


_l — mr 2 6 2 
2 


V (r) 


The Lagrange equation for r is 


or 


d^dC _ dC 
dt dr dr 

mr = mr 6 2 — V' (r) 


The Lagrange equation for 9 is 

ddC _dC 

dt d9 d9 
or 

d o • 

— mr~ 9 = 0 
dt 

This expresses conservation of angular momentum in polar coordinates, because 


(13) 

(14) 

(15) 

(16) 
(17) 


mr 2 9 = mr vg = [mr x v\ z 


Thus 


mr 2 9 = L 


where L is the nonzero z component of L = L z. 
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L 

mr 2 


(18) 

(19) 
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So far, everything we have done is valid even if V (r) depends on time explicitly. But usually, 
we are interested in a 1/(r) that is time independent, in which case there is a conserved energy, 

E = —mr 2 + ^ m r 2 9 2 + V(r) (20) 

Because the angular momentum, L, is constant, we know 9 if we know r and L. Thus we can now 
eliminate 9 from the energy and write 

1 i l 2 

E — -mr 2 + -- ~+V{r) (21) 

2 2mr z 

This is a very important equation. As with most important equations, I hope that you will not so 
much memorize it as understand the logic so well that you can reproduce it instantly whenever you 
need it. In this case, the logic is conservation of energy and angular momentum. Among other 
things, (21) implies that a particle with nonzero angular momentum can never get to the origin, 
unless the potential goes to minus infinity as fast as 1/r 2 . The |^r acts as a barrier (sometimes 
called the “angular momentum barrier” — clever, no?) that keeps it away. Thus for reasonable 
potentials, there is some radius of closest approach to the origin for any motion with nonzero L. 

V (oo) = 0 

We can learn a certain amount about what goes on for a particular central force law just from (21). 

Consider an attractive potential that goes to zero as r —► oo. For definiteness, lets consider a 
class of potentials 

V(r ) = —ar~P (22) 

for positive a and j3. This is nice because it includes the gravitational potential. For potentials of 
this form, (21) looks more of less like this: 
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The nature of the orbit depends on the energy. If E is positive, there is a point of closest 
approach, but the particle then goes out to infinity with nonzero kinetic energy. 



Basically, orbits with positive energy look pretty much the same so long as the potential is not 
really weird. But if E is negative, then things get more interesting. The particle cannot get out to 
r = oo so the orbiting mass must move back and forth between a minimum and maximum radius. 



But the details of what the orbit looks likes in two dimensions is very complicated in general. 
For the — 1/r potential, the orbit closes. But that is not true for (3^1. We can look at these 
numerically in central-force.exe. 

Note that except for (3 — 1, the point of closest approach to the origin (called the “perihelion” 
— the point farthest away is called the “aphelion”, though this term is almost never used) moves 
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or “precesses” as the mass orbits. The advance of the perihelion of mercury is a famous test of 
Einstein’s theory of general relativity. 

For the minimum possible energy, things get simple again. The minimum possible orbit for a 
given angular momentum corresponds to a circular orbit. 



The parallel axis theorem 

Our expressions for moment of inertia and angular momentum about an axis have so far assumed 
that the axis goes through the origin of the coordinate system. If instead, the axis goes through 
some arbitrary point R in the direction h, the velocity of the point r} is given by 

Oh x (fj — R) (26) 

and the moment of inertia about this axis is 

Ir = m j I™ x Co - R)\ 2 (27) 

j 

This is valid for any point R, but the result can be put into a particularly simple form if R is the 
center of mass, defined by 

= g j "'j 0 = g j m 3 O (28) 

Ej rnj M 

where M = E j fn ;j is the total mass, 

g" m,j Tj = M R (29) 

j 
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Expanding (27) gives 


Icm = 21 m j I nxfj 

3 

= 22 m j x fj 2 — 2 

3 


\ 2 — 2^2 rrij (n x fj) ■ (n x R) + "^2 m j \n x -R| 2 

x mj fj^j ^ • (n x R) + \ n x R\ 2 


= 22 m 3 1^ x 2\ 2 — 2 M(n x R) ■ (n x R) + M \n x R\ 2 

3 

= I — M\n x R\ 2 = I — M R 2 


( 30 ) 


where R is the distance of the center of mass from the original axis. We can rewrite (30) as the 
“parallel axis theorem”, 

I = Icm + MR 2 (31) 

which in words says that the moment of inertia about an arbitrary axis is the moment of inertia 
about an axis through the center of mass plus the moment of inertia of a point mass M a distance 
R from the axis. 


Example — Planar bodies moving in a plane 

A simple situation in which we can use similar ideas in a more general way is the motion of 
planar rigid bodies in the plane. This sounds like a special situation, but in fact, it includes a lot 
of interesting situations. The angular momentum in this situation is always perpendicular to the 
plane. The important point here is that the configuration of a planar rigid body moving in a plane 
can be specified by the angle 6 that gives the orientation of the rigid body in the plane, plus the 
position of the center of mass r, where f = (x, y) is a two dimensional vector in the plane. The 
kinetic energy of such a body is 

1 • 1-2 

-I0 2 -\—mf (32) 

2 2 

where m is the mass of the body and / is the moment of inertia of the body about an axis perpen¬ 
dicular to the plane through its center of mass. If there are several such objects, the kinetic energy 
will be a sum, and the interactions will typically conserve momentum in the plane and angular 
momentum perpendicular to the plane. 

Here is a simple example. Suppose a point mass m slides with speed v in the plane in the re¬ 
direction at y = a and collides and sticks to a uniform rod of mass // of length I intially at rest 
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centered at the origin and stretched along the y axis from y — — b to y — b, as shown below: 


y 



( 33 ) 


The center of mass of the final system has y coordinate 

1 


Vcm 


rn + /i 


rn a + /i 0] = 


m a 


m + fi 


(34) 


After the collision, the point on the rod-mass system at y — y cm moves in the x direction with 
constant velocity 

—— (35) 

rn + n 

To determine the rate at which the rod-mass system rotates about the center of mass, we first 
note that the angular momentum about the center of mass is 


fia 

I u — —m v - 

m + yU 


(36) 


The moment of inertia about the center of mass can be computed by adding the contribution from 
the point mass 

/ \ 2 


I m = m(a — y cm ) 2 = m ( ) 

\m + nj 

to that from the rod, which from the parallel axis theorem is 


(37) 


, T 2 fib 2 

7 ^ — 20 + ^ Vcm - + I 1 


m a 


m + n 


(38) 


where we have used the fact that the moment of inertial of the rod about its center of mass is fi b 2 /3. 
Thus the moment of inertial of the rod-mass system about its center of mass is 


/ = m 


H a 

™ + n 


yb 2 

3—^f I 1 


m a 
m + n 




rn + fi 


(39) 
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Putting all this together, we can work out how the system moves by putting together the uniform 
motion of the center of mass with the rotation about the center of mass. This system is animated 
in the Mathematica notebook rodball.nb on the web page. The animation allows you to vary the 
parameters m and a , and also to make the center of mass visible, so you see more easily its uniform 
motion and the rotation of the system around it. I hope that you will play with this one. 

Torque - fixed reference point 

We believe that at the deepest level, the Lagrangian of the world is rotation invariant, and that total 
angular momentum is conserved (at least if the reference point is fixed), like total energy and total 
momentum. But a system that is not isolated from the rest of the universe may have an angular 
momentum that is not conserved, just as it may have a momentum or energy that is not conserved. 
We describe this situation in terms of the concept of torque. The angular momentum of a rigid 
body about a fixed reference point r 0 is 


L ro 

Y (O - r Q ) x pj = Y m j (o - r 0 ) x fj 

(40) 


3 3 


If we then define the torque as 

Tr 0 = Y (O - f o) X Fj 

3 

(41) 

Then 


d L 
Jt ro 

= Y m 3 O xrj + Y m 3 (0 - r 0 ) x fj 

(42) 

3 3 



= Y (o “ f o) X Fj = T ro 

(43) 


3 


The first term on the right hand side of (42 ) vanishes because of the antisymmetry of the cross 
product. 

The fixed reference point r 0 is completely arbitrary. This just corresponds to our freedom to 
choose the origin of the coordinate system. The vector fj — r 0 describes the position of the jth 
mass in a coordinate system with origin r 0 . 

It is also worth noting that the the forces in (43 ) need only include the “external” forces that 
prevent the body from being isolated and that break rotation invariance for the non-isolated system. 
This must be true for general reasons, and we can also see it explicitly if we make some more 
explicit assumptions. 

The explicit calculational reason that Morin describes assumes that we can break up the force 
into forces between “point particles,” that we all the forces involve only two particles at a time and 
that the force between point particles is a central force. The central force assumption is not un¬ 
reasonable because the rotation invariance of the underlying theory implies that the force between 
two point particles in the system must be along the line between them because there is no other 
vector along which it could point. Then we can use Newton’s third law. Thus if F ab is the force on 
particle a from particle b. 

Fab = -F ba oc r a - f b (44) 
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The first equality is Newton’s third law, and the second comes from the assumption of central force. 
But then the contribution to the torque from this pair of forces vanishes because 


(r a ~ r 0 ) x F ab + (f ft - r 0 ) x F ba oc (f 0 - r b ) x (r a - r b ) = 0 


(45) 


This is the argument in Morin. 

But we really don’t need to make such explicit assumptions. The general reason the torque from 
internal forces vanishes is that, as we noted, we believe that in any really isolated system, rotation 
invariance holds and angular momentum is really concerved. Any violation of angular momentum 
conservation arises simply because we have not isolated the system. The internal forces that hold 
the body together do not contribute to the torque, because what “internal” means is that they don’t 
depend on what is going on outside the system. They are invariant under rotations if as we believe 
the full theory of the system in isolation is invariant under rotations. The forces that hold the body 
together simply rotate along with the body when it is rotated. If the external forces went away, the 
internal forces would still be there holding the body together, but there would be no torque because 
the body would be isolated and rotation invariance would not be broken. 

In pictures, what we are doing is separating a part of our system that we call “internal.” 


lextemal; 


internal; • • • 


lextemal 


If we rotate the 


internal and the external parts of the system together, that has no effect on the 
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Lagrangian. The energy doesn’t change. There is no torque. 



But if we rotate the internal parts of the system, while keeping the external parts of the system 
fixed, then the energy can change and there be a torque. But this depends only on the relative 
orientation of what we have called internal and external. We haven’t changed the internal system 
itself. 


external- 
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In fact, we could get the same torque by keeping the internal system completely fixed, and rotating 
the external system in the opposite direction because this doesn’t change the relative orientation of 
internal and external, and because of overall rotation invariance, that is all that matters. Obviously, 
in this case we have not changed the internal system at all, so there is no purely internal contribution 
to the torque. 



Example — oscillations of a hanging rod 


Here’s a simple example of using torque and moment of inertia — consider the small oscillations of 
a system of a solid rod of length £ with uniform linear mass density p pivoted at the top and allowed 
to rotate in a vertical plane in the earth’s gravitational field, as shown (with the rod constrained to 
rotate in the plane of the paper about the pivot at the top): 

(46) 



We can take 6 to be the angle of the rod from the vertical. The moment of inertia / is given by 


/ — Y, nrij | n x f) 


(47) 
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To finish the job we must actually compute the moment of inertia /. We can write the “sum” in 
(47) as 

/ = / px 2 = -pi 3 dx (48) 

Jo 3 

Thus the angular momentum is (out of the paper) 

L = I6= l pt z 6 (49) 

3 

To find the torque, we can use the fact that we can assume that the graviational force from the 
uniform field acts on the center of mass. This is because torque is linear in the position, so the 
integral would do to find the torque by adding up the contribution of all the little masses in the rod 
is the same integral we would do to find the center of mass. 


J r x dF = J r x (— gz ) dm = (^ J rdrn) x (— gz ) = MR x (— gz ) = R x F (50) 


This is 


r = —pig {i/2) sin 6 = —-pi 2 g sin 9 


(51) 


again out of the paper. Thus using the fact the torque is rate of change of angular momentum, we 
can write 

d ■ 1 „ 1 „ 

(52) 


/Ljo = JO = ]-pi z e = -\pi 2 g sin 9 
dt 3 2 


or 


For small 9, this is a harmonic oscillator with angular frequency 

t3g 


LJ = 


2 i 


(53) 


(54) 


Notice, however, that the result is quite different from what we would get for a simple pendulum 
with a mass at the center of mass of the rod. That would have angular frequency 


u = 



(55) 


The effect of the moment of inertia is to add some extra inertia to the system which decreases the 
angular frequency and increases the period of oscillation. 

We could do this somewhat more easily using the Lagrangian, without mentioning torque. The 
kinetic energy is 

\l0 2 (56) 

The potential energy can be taken to be 


P^2 (i-cosfl) 


(57) 
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(the constant 1 is not important, but I guess it is nice to have the energy defined to be 0 at 6 = 0). 
Then the Lagrangian of the system is 


C 



p £ 2 

~Y 


(1 — cos 6) 


and the Euler-Lagrange equation is 


T* 



(58) 


(59) 


Torque - moving reference point 

Things are slightly more complicated when the reference point is moving - when r 0 =4 0. But only 
a little, so long as we look at things in a reference frame that is moving along with the reference 
point. The first thing to note is that this absolutely has to work the same way if 

f = 0 (60) 


because in this case the frame in which reference point is fixed is moving with constant velocity 
and is therefore an inertial frame, just as good as the one we started in. In the moving reference 
frame, positions and velocities are different, but forces and accelerations are all the same, and 
Newton’s laws work the same way. 

The angular momentum in the moving frame in terms of the original r looks like 

4 0 = 51 m j if ~r 0 )x {fj - f 0 ) (61) 

3 

We get the second terms in the parentheses because we are measuring the position and velocity in 
the moving frame and have to subtract the new reference point. 

If the reference point is accelerating, r 0 ^ 0, this is a dangerous thing to do, because Newton’s 
laws don’t work quite the same way in an accelerating frame, but we will see how that works in a 
moment. However, nothing prevents us from considering L ro defined by (61) in this way, so let’s 
try it. Now we can just differentiate and see what happens. 

jL ro = 55 m j fj ~ f ) x (fj - ro ) + 55 m j fj ~ f) x fj ~ f ) (62) 

j 3 

Again the first term on the right hand side vanishes because of the antisymmetry of the cross 
product and this looks just like (42) except for the r 0 term. 

^L ro = f ro - 55 m 3 fj - f ) X r 0 (63) 

There are a couple of important situations in which we can simply ignore the term proportional to 
r 0 . One, as we expected, is when the second derivative vanishes, 

ro = 0 — that is if r 0 is not accelerating. (64) 
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The other is more interesting. We can rewrite the r 0 term using the definition of the center of mass 
as 

- Y m j {rj ~ r 0 ) x r 0 = -M {R - f 0 ) x r 0 (65) 

j 

Evidently this vanishes if 

fo — R — that is if r 0 is the center of mass; (66) 

or if 

(fo — R) is parallel to r 0 so the cross product vanishes. (67) 

Then 

d ~ 

-j-^L ro T ro ( 68 ) 

where the torque r ro 

T'o = Y (r 3 - f 0 ) x 4 (69) 

j 

is computed in terms of the forces in the lab frame. 

Thus we can continue to use t = dL/dt if we take the reference point to be the center of mass 
even if the center of mass is accelerating! The best way of understanding this is something we 
will discuss in more detail next month. We can actually use Newtonian physics in an accelerating 
reference frame if we also introduce what is called a “fictitious force” to make up for the effect of 
the acceleration. Next month, we will argue that we can simply put the r 0 on the other side of the 
F = rnd equation and associate it with such an “fictitious force.” This force behaves exactly like a 
uniform gravitational force (indeed, Einstein would say that it is not really fictitious - it is entirely 
equivalent to a gravitational force - this is the starting point for general relativity - but we won’t go 
there right now). But a uniform gravitational force just acts on the center of mass, and it produces 
no torque about the center of mass. That is why this works. 

Thus if we take the reference point r 0 to be the center of mass R, the rate of change of angular 
momentum about r 0 is equal to the torque about r 0 , even if r 0 is moving. This is a good thing, so I 
will assume that the reference point is the center of mass unless explicitly stated. From now on, I 
will drop the subscript R. 
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Example — Rolling ring 

Here is another example — a ring of mass m and radius r rolling without slipping down an inclined 
plane in a gravitational field. 



This is a system with one degree of freedom. We can specify the position of the ring either by the 
distance £ that the ring rolls, or by the angle 6 through which the ring rolls, and these are related 
by 

£ = r6 (71) 

Let’s do this three ways. First let’s consider the torque and angular momentum about the 
reference point defined by the point where the ring touches the plane. The nice thing about this 
reference point is that the torque on the ring is independent of the force that the plane exerts on the 
ring. It comes entirely from the gravitational force, which as usual we can take to act at the center 
of mass. This point is instantaneously at rest, and the motion of the ring at the instant it is at rest is 
a rotation about this reference point Then the torque is 

mgr sin 0 (72) 

The moment of inertia is given by the parallel axis theoren as 

I = rnr 2 + I C m = 2 mr 2 (73) 

because the mass is all a distance r away from the center of mass. Thus we have 

L = Id = 2 mr 2 9 = mgr sin 0 (74) 
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or 

0 — — sin (f> (75) 

2 r 

This describes uniform angular acceleration. In terms of rQ, the distance down the inclined plane, 
it is uniform acceleration with g sin (j)/2. Note that there is an extra factor of 1/2 compared with 
mass sliding down a frictionless plane with the same angle. This is the effect of the extra inertia 
associated with the moment of inertia. 

I glossed over an important point here, and let me now emphasize it. Our reference point in this 
analysis is not moving. We are looking at the system at a particular time, t, and the reference point 
is the fixed point where the ring touches the plane at that one time — not the moving point where 
the ring touches the plane as a function of time. The way this shows up in the calculation is that 
we computed the angular momentum in the lab frame. If we had been using a moving reference 
frame, we would have had to measure the velocities of the parts of the system with respect to that 
moving frame. That would have been more complicated - and also useless in this case, because the 
point at which the ring touches the plane is accelerating as a function of time. 

Now let’s do the problem using the Lagrangian. As the ring rolls without slipping, its center 
of mass moves down the plane, and it also spins. Thus its kinetic energy is a sum of a linear and 
a rotational term. If we describe the configuration of the system by the angle 6 that the ring rolls, 
the linear distance it moves is r 6 . Thus the kinetic energy is 

^m(rd) 2 + ^I CM 0 2 (76) 

and Icm in this case is just m r 2 because all the mass of the ring is a distance r from the center of 
mass. The gravitational potential is 

— mgrd sin <p (77) 

Thus the Lagrangian is 

m r 2 6 2 + mgrd sin <p (78) 

and the Euler-Lagrange equation is 

2m r 2 6 = mgr sin cp (79) 

which again gives 

6 = — sin (p (80) 

2 r 

Another way to do the problem is in term of torque and rate of change of angular momentum 
about the center of mass. But to do it this way, we need to consider the force F % that the inclined 
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plane exerts on the ring, as shown below. 



The normal component of F t (labeled by N in the diagram) just cancels the normal component of 
the gravitational force, mg cos 0, because the ring doesn’t accelerate off the plane. The tangential 
component of F t (labeled by T in the diagram) is the frictional force that keeps the ring from 
sliding on the incline. Now the acceleration of the ring down the incline is determined by the 
difference between the tangential component of the gravitational force and T, 

ma = mg sin 0 — T . (82) 

But the angular acceleration of the ring is governed by the moment of inertia, / = mr 2 , and the 
torque about the center of mass, which is out of the plane of the paper with magnitude rT, 

mr 2 0 = rT (83) 

Because the ring is rolling without slipping 

a = r6 (84) 


so (82) becomes 


mrO = mg sin 0 — T. 

Multiplying (85) by r and adding it to (83), we can cancel the unknown T and find 


2 mr 2 0 = mgr sin 0 


(85) 

( 86 ) 


which agrees with (79) as it should. 

As we have seen before in other contexts, the Lagrangian technique simplifies things. It allows 
us to solve the problem without introducing and then eliminating the frictional force, T. But we 
were able to do it just as easily by choosing the reference point appropriately 
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lecture 18 


Topics: 

Where are we now? 

Impulse and elastic collisions 
Rigid bodies are weird 
The angular velocity vector 
Impulse and rigid bodies 
The moment of inertia tensor 
An impulsive demo 

Where are we now? 

We began our discussion of rigid body rotations by discussing the simple case of rotations about a 
fixed axis. Today, I will spend a little time discussing in general the important ideas of the angular 
velocity vector and the reference point. I am also going to discuss a nice example of motion 
in a plane, which introduces the idea of impulse that we will use to explore more complicated 
situations. 

Impulse and elastic collisions 

The animation in the Mathematica notebook rodbounce.nb shows a rigid rod in the y-z plane in a 
gravitational field bouncing completely elastically on a frictionless surface. This problem is a nice 
example of the use of impulsive forces and torques to solve problems. I will start by analyzing this 
example to show you how the animation was produced. 

We will assume that the rod is initially either not rotating at all (this is what is shown in the 
animation) or rotating in a vertical plane. If so, the motion stays in the same vertical plane, as long 
as the rod is perfectly symmetrical and the plane on which it bounces is perfectly flat. Thus we can 
analyze it without worrying about the full three dimensional complexity of angular momentum. 
Then we can just choose our coordinate system so that 5 is vertical and the rod is bouncing in the 
y-z plane, as we assumed. 

Except when the rod is actually in contact with the frictionless surface, the motion is extremely 
simple. The rod rotates with some fixed angular velocity uj } — the subscript i is for “initial” (the 
axis x is out of the plane in the x direction) and the center of mass, in the center of the rod rises and 
falls in the constant gravitational field, so that when one end of the rod hits the frictionless surface, 
the center of mass is moving with some velocity v\ in the vertical direction (v — v z ), which will 
usually be negative, but not always, because the rotation of the rod may cause a collision even if 
the center of mass is rising. When a collision occurs, we get a new velocity and angular velocity, 
Vf and ojf (subscript f for “final”). Our job is to calculate Vf and u>f in terms of v, and — then we 
can follow the system until the next collision and do it again, and so on until we get tired. Or better 
still, we can simply program it into the animation and watch the pretty bouncing rod until we get 
mesmerized. 
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Suppose that the rod has mass m and length 2 t. Suppose further that the collision occurs with 
the rod at an angle 9 (between 0 and 7r) from the horizontal, as shown below. 

x r 



( 1 ) 


Also shown is the translational velocity of the center of mass and the rotational motion, tux x r, 
which gets added on to the motion of the center of mass to produce the full motion of the end of 
the rod. 

During the collision, there is a force, F, on the rod from the frictionless surface. Because the 
surface is frictionless, the force is purely vertical, F = F z. Now because the bounce happens 
very quickly, we can ignore the motion of the rod while the bounce is taking place. Then all that 
matters is the integral of the force over the period of the bounce, 

[ dt F — z f dtF (2) 

J bounce J bounce 

This is called the “impulse.” Now the point is that the impulse does double duty. 1 — It changes the 
linear momentum of the center of mass. Because the force is the rate of change of the momentum, 
the impulse is the total change in the momentum: 


m (vf — Vi) = / 

J b 


dt%=[ dtF 

bounce Clt J bounce 


(3) 


2 — It also changes the angular momentum about the center of mass. Because the torque is the 
rate of change of angular momentum, the cross product of the lever arm with the impulse is the 
total change in the angular momentum. The rod has length 2 1, so this looks like 

/ (cuf — u>i) = f dt{rxF) x =(rx f dt f\ = —£cos9 f dtF (4) 

./bounce V -/bounce / x -/bounce 

Let me emphasize again the key step here. Because we have assumed that the bounce takes place 
very quickly, we can ignore the motion of the rod while the bounce is taking place. That allows us 
to take the r out of the integral in (4). This makes the problem doable because the change in v and 
the change in u are related — 


i cos 9m(v f — Vi) + / (cuf — u>i) = 0 


(5) 


In addition to ( 5), we know that energy is conserved. The energy is the kinetic energy in motion 
of the center of mass and rotation. Thus conservation of energy is 


- m v\ + - I uf — - m Vf + - I ujf 
2 1 2 1 2 f 2 f 


( 6 ) 
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At this point, we could plug this into Maple or Mathematica and ask the computer to solve for Vf 
and Uf in terms of V\ and U[. But there is a useful trick involved in doing it by hand, so let’s go on 
a while. First write energy conservation as 

m {vf - v 2 ) + I (cUf — c of) = 0 (7) 

Now we can factor this 

m (V{ - Vi) (v { + Vi) + I (cUf — CUi) (cUf + CU;) = 0 (8) 

and now use (5) to write this as 


m (vf — Vi) (vf + v^ — I cos 9 m {y f — Vi) (cuf + cui) 

— m(v f — Vi) ((vf + Vi) — £ cos 9 (cuf + cui)) = 0 

The trick here is pretty general. We know there is a solution to the twin equations (5) and (6) of 
the form u f = v\ and tUf = tu; because this satisfies (5) and if nothing changes, energy is conserved. 
We have just written (6) so that this solution is manifest. Of course, we are not interested in the 
case where nothing changes because this is not what happens in the collision. There must always 
be some force on the rod during the collision, so we always get a non-trivial change in v and uj. 
But writing (6) this way allows us to eliminate the trivial solution and makes it easier to find the 
interesting one. Thus for the physical solution we are interested in, we must have 

(vf + r?;) — I cos 9 (cuf + cUj) = 0 (10) 


This is now another linear equation for V{ and cuf, so we can easily solve (5) and (10), and the result 
is 

{mi 2 cos 2 9 — 1) + 21 d cos 9 tu; 2 ml cos 9v- x — {mi 2 cos 2 9 — I) cui 

f mi 2 cos 2 9 + 1 f mi 2 cos 2 9 + 1 

It is this that we have used to construct the animation. 

Nothing we have done depends on the precise value of I. For a solid rod, / = mi 2 / 3, but 
we don’t have to look only at that case. It is interesting to look at this for various Is. One very 
interesting limit is I = mi 2 , which corresponds to a dumbbell, with the masses at the ends of 
a light rod. This is animated in the Mathematica notebook dumbbellbounce.nb This actually 
allows us to use our physical intuition to get a nice check of (11). Suppose that for a dumbbell, 
cos 9 is close to zero when the left mass hits the surface. Because the force of the rod on the 
masses is nearly horizontal in this case, it has very little effect of the motion of the two masses. 
Thus we expect the left mass to bounce and simply reverse its velocity, and the right mass to just 
keep going. Now for 9 « 0, the motion of the masses is nearly in the vertical direction and the 
vertical components are approximately 


^left ~ V - iuj n right ttV + i u 


( 12 ) 
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Thus to get uieft to change sign while u rig ht to remain unchanged, we want the bounce to approxi¬ 
mately interchange v and i u. 


Vi 






(13) 


It is easy to see from (11) that this is what happens. 

In the case of the rigid rod, with smaller moment of inertia, the other end of the rod actually 
moves faster after such a nearly horizontal bound. Again a limit may make clearer what is going 
on. The limit / —> 0 corresponds to a mass in the center of the light rod. In this case, for 0 ~ 0, 
we expect the center of mass to keep moving, V{ ~ v t , which again accords with (11). 

Meanwhile, notice that when the rod is rotating a lot, it doesn’t go up as far — this is because 
more of the energy is stored in rotational kinetic energy and there is less in center of mass motion 
after the collision. 

Rigid bodies are weird 

So what’s the big deal about torque and angular momentum. Surely, this is just like force and 
momentum. You push something and it moves (or accelerates, at least). You twist something and 
it turns. But torque equals rate of change of angular momentum implies some pretty remarkable 
things. When you twist a spinning rigid body carrying a large angular momentum, the twist does 
very counter-intuitive things because the twist does not directly change the orientation of the body. 
Instead, what a torque does is to change the direction of the angular momentum. And the direction 
of the angular momentum is tied not to the orientation of the body, which is constantly changing, 
but to the orientation of the rotation axis. A gyroscope is the most familiar example of this. A 
torque that one would naively think would cause the body to fall instead causes it to precess. This 
is very familiar, but it is worth seeing over and over again. Here it is for a simple top. Take a 
bicycle wheel and weight the rim with lead. Get it spinning with speed v. The angular momentum 
is then approximately mvr where r is the radius of the wheel and m is the total mass. This would 
be exactly right if all of the mass were concentrated in the rim at radius r. 

L 



v 
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Now if we apply a torque to the handle, which is the axis of rotation, strange things happen because 
we are actually changing the angular momentum. Precession is one example. 



(15) 


The torque in the diagram is into the paper. Thus the change in the angular momentum is into the 
paper. But the only way that can happen is if the direction of the angular momentum changes — 
and the orientation of the handle must go with the angular momentum — so the system precesses. 
You see that precession is easy to explain in terms of torque and angular momentum, but perhaps 
not so easy to understand in your bones. 

Here is another situation which is basically the same, but which I find even stranger. If I stand 
on a turntable with the wheel axis horizontal, and I try to twist the handle so that the angular 
momentum of the wheel points slightly down, to conserve angular momentum, I will have to start 
spinning in the counterclockwise direction, and develop angular momentum upwards. This is 
pretty weird, because it means that by trying to produce a torque in one direction (horizontal) I 
have actually produced a torque in the vertical direction. 

This is so strange that it seems magical, even though we have all seen it many times. What is 
really going on here?????? 

I find this sufficiently strange that I want to show you how it comes about in a particular very 
simple case. 

Consider a light rigid frame of crossed bars with weights of equal mass m on two of the 
opposite ends. The two masses form a dumbbell rotating in the x-y plane. The cross piece is 
supported in two frictionless sleeves that allow the system to rotate, but can be used to supply a 
torque. This is shown below in the x-z plane: 


velocity v y 
out of paper 



L 


2 mvl z 


velocity —v y 
into paper 


x 

—> z 


(16) 


If at some time t — 0, the upper mass at x — £ is moving in the +y direction with speed v, and the 
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lower one at x = — £ in the — y direction with the same speed, then the angular momentum of the 
system is 2 mvi z. 

Now suppose that at time t — 0, we apply a large torque N in the +y direction for a very short 
time At, short enough that we can neglect the motion of the masses during the time. We actually 
supply this torque by twisting the frictionless sleeve, 



but the effect is the same as applying the forces to the masses (because of the rigidity of the frame) 


F' = 


N 

21 



T?' — K 
1 21 

The magnitude of the force is fixed by the value of the torque. 

Now the torque in (18 ) changes the direction of the motion of the two masses. The change in 
the momentum is F' At. From the top, in the y-z plane, the resulting momentum looks like this 
(with the momenta of the bottom mass dashed and the masses not shown) 

t / 


i / 



Now as the masses move in their new direction, they drag the rest of the rigid body along with 
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them, establishing a new axis of rotation, shown as the dotted line below. 



( 20 ) 


Thus the system must precess. 

Of course, this all really happens at once, but it makes it easier for me to understand what is 
going on in precession if I disarticulate things and think first about a very quick application of 
torque changing the direction of the rotating masses, and then the rigid body forces that hold the 
system together pulling the rest of the system along with them. Over the next couple of weeks (and 
starting later today if I have time), I am going to use this trick in more complicated ways to ease 
our way into the physics of rigid body rotations. 

The angular velocity vector 

We have seen that the velocity of point f) on a rigid body rotating with angular velocity uj about 
an axis n through a point r Q is 

cun x (fj — r 0 'j (21) 

The important point is that u> and n always appear in combination, as the product to h. This is a 
vector with direction n and magnitude to. It is called the “angular velocity vector” 


u = u ft (22) 

Now here is a very important fact. Like any vector, 0 can be taken apart into components. 
Angular velocity vectors can be added and subtracted. This may not sound very remarkable, but 
in fact, ordinary rotations do not work this way. Unlike vector coordinates like the coordinates 
of the center of mass, the quantities that describe the orientation of a rigid body are not vectors. 
They are angles and they do not form a linear space. You can’t add them. The reason is simply 
that the structure of rotations is more complicated than the structure of translations. The order in 
which rotations are done matters to the final configuration. Because order doesn’t matter when you 
add numbers or vectors, that means that rotations cannot simply add. They must compose in some 
more complicated way. 

Here’s a simple example. One good way to specify the orientation of a rigid body is by specify¬ 
ing a reference orientation and specifying the rotation required to get from the reference orientation 
to the actual orientation. A rotation, in turn, can be specified by giving the axis of rotation, and 
the magnitude of the rotation in radians. If you put together two rotations about the same axis, the 
magnitudes just add. But the trouble is that if the axes are different, the combination of the two 
rotations is a rotation about some new axis, by some magnitude that is a complicated function of 
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the angles and axes. Both the axis and the magnitude of the combined rotation depend on which 
of the component rotations is performed first. This is what makes rotation of rigid bodies such a 
complicated and interesting subject. 

For example, consider a regular tetrahedon with the four vertices labeled by different numbers, 
1-4. Looking down on this tetrahedron, it might look like this: 



where a, b and c represent axes through the center of the tetrahedron (each of these vectors is 
coming slightly out of the plane of the paper). 

If I do a rotation by 27 t/ 3 about the a axis, the tetrahedron rotates into itself with the numbers 
on the vertices rotated as 1 —> 2 —>• 4 —> 1, as shown: 



If I now do a rotation by 2n/3 about the b axis, the tetrahedron rotates into itself with the numbers 
on the vertices rotated as 2 —> 3 —> 4 —> 2, as shown: 



The result of these two rotations is equivalent to a single rotation by 47 t/ 3 (or —2tt/3) around the 
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c axis, as shown below 



On the other hand, if I do the same two rotations in the opposite order, something different 
happens. If I first do the rotation by 27 t/ 3 about the b axis, the tetrahedron rotates into itself with 
the numbers on the vertices rotated asl—>3—>2—>1, as shown: 



If I now do the rotation by 27r/3 about the a axis, the tetrahedron rotates into itself with the numbers 
on the vertices rotated as 1 —> 4 —> 3 —» 1, as shown: 



The result of these two rotations is equivalent to a single rotation by 47 t/ 3 (or —2tt/3) around an 
axis pointed down into the plane, as shown below 
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So you see that for finite rotations, the order of the rotations makes a difference. There is no 
way that you can simply add the coordinates of vectors to get these results. Finite rotations are not 
vectors! 

The reason that angular velocities are simpler is that they really only refer to infinitesimal 
rotations — u; is dd/dt. And infinitesimal rotations can be added without causing confusion. 
Technically, the reason that infinitesimal rotations are can be added like ordinary vectors is that 
the depedence on the order of two infinitesimal rotations is proportional to the product of the two 
infinitesimal angles, and can thus be ignored. Thus dO is a vector, even though a finite rotation is 
not. The relation between the angular velocity vector and the motion of the parts of a rigid body, 

w x (fj - r 0 ) (30) 

will be the crucial simple fact to hang onto as we explore the complicated world of rigid body 
rotations. This depends on not only the direction of the axis, but exactly where the axis is, which 
can be specified by specifying any point r 0 on the axis. Any other point on the axis gives the same 
result, because, any other point has the form 

r a = r 0 + ah (31) 

The extra term proportional to n doesn’t affect (30) because u and n are in the same direction so 
their cross product vanishes. You should remember that if you are going to add angular velocities, 
they must be defined with respect to the same reference point r 0 . Otherwise, things get messy. 

Impulse and rigid bodies 

So to follow up the notion that thinking about impulsive torques is easier to understand, we are 
going to spend some time dealing with the following question. Suppose that we are out in space, 
and we come upon a light rigid rectangular frame with masses at the corners, as shown: 



x F 


(32) 
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It is floating at rest, say in the x-y plane, when we hit one of the masses with a hammer, applying 
a large force in the — z direction for a very short time, as shown. This produces an impulsive force 
and torque on the rigid frame. The question is, immediately after the hammer blow, what is the 
velocity of each of the masses? We will not actually solve this problem until next time, but today, 
we will start to explore a crucial component of the answer — we will discuss the moment of inertia 
tensor and the relation between the angular momentum L and the angular velocity Lo. 

But first you might ask — Why have I formulated the problem in this peculiar way? Why not 
just ask for the trajectory of each of the masses for all time after the hammer blow? The answer is 
that these trajectories are MUCH harder to find and to understand than the velocities I have asked 
about. I will begin by explaining why this is so. You may guess that the answer has something to do 
with impulse — the fact that we have applied the force in a very short time so that the frame does 
not have a chance to move while the force is being applied. That is correct, but it is only part of the 
difference. Complications arise because the direction of the angular momentum after the hammer 
blow does not coincide with the instantaneous axis of rotation of the body. When we calculate L 
in terms of u and the parameters rrij and r 3 that describe the rigid body, it just turns out that except 
in very special circumstances, L and u are not in the same direction. This fact will cause us lots 
of grief when we try to calculate the actual trajectories, and I want to postpone the worst of it. In 
fact, in this course, we will not actually ever solve for the full trajectories in this case, although 
we will do so in some interesting and very non-trivial examples. But just finding the velocities 
of the masses right after the hammer strike is not so bad. We do this by studying carefully the 
vectors p (linear momentum), L (angular momentum), and u (angular velocity) and understanding 
the relationships between them in detail. We will find a peculiar relationship depending on a 
complicated object called the moment of inertia tensor. We will explain this as cleverly as we can, 
but this is one of those cases in which no amount of cleverness can make it look really simple. 
There are times when you just have to be very careful and let the mathematics carry you along. I 
hope that going over it in several different ways will help you get a feeling for this difficult subject. 

The moment of inertia tensor 

Let’s go back to our expression for the angular momentum of a rigid body about a point R, 

L = X! fa - R) x Pj = Y m 3 (U -R)x{fj-R) (33) 

3 3 

where the sum runs over the various massive parts of the system, labeled by the index j. We will 
most often be interested in the case where R is the center of mass, but that is not necessary. All 
we are going to assume in (33) is that the motion is entirely due to the rotation of the body about 
the point R. That is, we are calculating the angular momentum in a frame in which the point R is 
fixed. Now we can calculate this for a rigid body rotating with angular velocity vector uj around R 
by using 

(fj — R) — uj X {fj — R ) (34) 
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Putting (34) into (33 ) gives 


L = ^2 uij ( fj — R ) x (u x (fj — R)j (35) 

3 

Notice that the components of L are just linear combinations of the components of Co, but the 
coefficients are some complicated looking sums. 

To get a feel for these scary looking sums, let us begin by looking at a case when there is just 
a single term. 

L — m\ (fj — R) x (u x (fj — R )) (36) 

This describes the angular momentum of a single particle of mass mi at the point fj rotating about 
the point R with angular velocity to. And let’s simplify further by putting R = 0 (this is just a 
choice of origin anyway, so it doesn’t cost anything). Then (36) becomes 

L = mi fj x (Co x fj) = — fj x (fj x Co) (37) 

We can understand what (37) is by decomposing to into pieces perpendicular to and parallel to fj, 


Co = Co _L + tO \| 

Now the parallel part does not contribute at all because of the cross products, so 

L = —mi fj x (fj x to±) 


(38) 


(39) 


But now each of the cross products rotates the vector to± by 90° in the plane perpendicular to fj 
and multiplies the magnitude by |fj|. Thus if we do this twice we get — |fj| 2 c7_i_, so (39) becomes 

L = —mi fj x (fj x d;_i_) = mi |fj| 2 dl± = mi (fj ■ fj) c7_l (40) 

Thus the angular momentum is not so complicated in this case. It is just mir\ times the perpen¬ 
dicular component of Co. However, already in this simple case, Co and L are not parallel! We saw 
this in a prs question a few lectures ago. 



In the diagram, Co is in the z direction, but L is perpendicular to r in plane formed by z and r. For 
the motion of a single point mass, while Co determines the velocity of the mass, the velocity does 
not uniquely determine to. to± gives the same velocity as to, and it is to± that matters for L because 
it is perpendicular to f. 
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The notation is (40) is not good enough to allow us to generalize this to the case of more than 
one j. The problem is that uj± depends implicitly on the direction of fj. To put this back into (35) 
we have to make this dependence explicit. To do that, note that the parallel component of uj can be 
written as 

£|| = n (f i • u) (41) 

and thus from (38), we can write 

uj± = uj — fi (f i • uj) (42) 

Combining (40) and (42) gives 

L — mi (fi ■ fj) uj± = m\ (fj ■ fj) {uj — fi (f i ■ uj)^) = mi (fj • fj) uj — rri] fj (fj ■ t u) (43) 


Putting the arbitrary reference point R back in and factoring out the factor of mi gives 


m i 


((fl - R) • (fj - R )) uj - (fj - R ) ((fj - R) ■ d;) 


(44) 


Now all the dependence on fj is completely explicit, and we can apply the same procedure to each 
of the terms in the sum in (35). The result is 


L = J2 m J [(if - R) ■ fj -R))w~ fj -R)((fj -R) 


■°) 


(45) 


Equation (45) is the desired relation between L and uj. However, it is useful to write it in a 
different form. The first thing to notice is that both sides of (45) are vectors, and that the right hand 
side is proportional to the vector uj, but is not, in general, in the same direction as uj. The second 
thing to notice is that the second term in (45) involves a dot product. We can think of this as the 
sum that appears in a matrix multiplication. This allows us to write (45) in a slightly more useful 
form in matrix notation. First define a matrix B as follows: 











Bxx 

B X y 

B xz ^ 




B = 

E 

m 3 (rj 

-R) 

(rj ~ 

R) 

= 


Byx 

Ryy 

By Z 





3 






\ 

B Z X 

B zy 

B zz j 




(Vi 

-R 

\x [ r j ~ 

R)x 

[fj 

-R 

x [ r j 

-R 

y fj 

-R). 

«fj - 

- R 

E m 3 

\?i 

-R 

]y [fj ~ 

R)x 

fj 

- R 

y fj 

-R 

y fj 

— R ]y fj - 

- R 

j 

V [ri 

-R 

]z [rj - 

R] x 

fj 

-R 

Z fj 

-R 

y fj 

-R] 

* fj - 

- R 


(46) 


(47) 


We have used a notation of two vector signs to indicated this matrix (Dave, in his book, just uses 
bold face and you have to figure out from the context whether you are dealing with a vector or a 
matrix). In general, this is not a good thing to do, but for our purposes in this course, it doesn’t 
do any harm to use a slightly sloppy notation in which we think of our matrices as “bivectors.” 
In fact, it will be kind of helpful. The reason is that the matrices we will have to deal with are 
symmetric, so we don’t have to be careful to distinguish which vector index is which. You can see 
the symmetry from the definition of B, because 


Rxy By X , 


Rxz Rzx 1 


Ryz Rzy 1 


(48) 
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Then we can think of the dot product in (45) as matrix multiplication, and just indicate it as a 
dot product (this is actually the notation that Mathematica uses). Then the second term in (45) can 
be written as 

- B-Co (49) 

The first term in (45) involves the trace of the matrix B 


m i (fj - R) 2 = Tr B = B xx + B yy + B zz (50) 

3 

So we can combine the two terms in (45) as a single matrix equation, 


L = I-u 


(51) 


where / is the matrix 


/ B XX Byy H - B ZZ 

0 

0 ^ 

: / B XX 

B x y 

B xz \ 

1 

0 

B XX Byy B ZZ 

0 

Byx 

Byy 

Byz 

1 (52) 

V 0 

0 

B XX “I - Byy H“ B ZZ / 

\B ZX 

B zy 

B zz j 

1 


or 


+ B z 


I = I -B 

-B z 


yx 


B x y 

B xx + B z 


~B X 

-B 


yz 


-B 


zy 


B xx + B 


yy - 


(53) 


Like the matrix B, the matrix / is symmetric 


Ixy lyx i lxz *zx j lyz Izy • 


(54) 


Thus / is completely determined by 6 numbers, 


L XX ) 


L yy ? 


L ZZ ) 


l xy j 


1 XZ ) 


(55) 


The object / is a thing called the “moment of inertia tensor.” The term “tensor” refers to a 
large class of objects in mathematics that are generalizations of vectors. We are not going to get 
into tensors in general. This particular tensor is fairly simple because it can be thought of as a 
machine for taking linear combinations of the components of one vector (u) to get another vector 
(L). Equations (51) and (53) show that you can also think of / as a 3 x 3 matrix, and think of the 
right hand side of (51) as a matrix product of the matrix / and the vector u. Because (51) is an 
equation for a vector, it has a meaning independent of the particular coordinate system in which 
we describe the vectors. But like a vector, the tensor / will have different components in different 
coordinate systems. What makes the moment of inertia tensor a tensor is that when we go from one 
coordinate system to another, the components of / transform in the just the right way so that (51) 
is still a correct vector equation. That is the meaning of the two vector signs on I. This is implicit 
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in (51), in which one of the vector signs on / is associated with the vector nature of L on the left 
hand side, while the other vector index is combined by the dot product with the vector index of Co 
so that the right hand side behaves like a vector. This will be very important in our understanding 
of rigid bodies. 

The general point about tensors is that one can actually go backwards through this argument 
and argue that / (because it takes a vector into another vector) behaves under rotations like a 
product of two vectors. That doesn’t mean that it is a product of two vectors, just that it rotates the 
way a product of two vectors rotates when we go from one coordinate system to another. 

In term of /, we can quickly derive for rotation about a fixed axis, n, the component of angular 
momentum in the n direction. In general, the h component of L is 

h ■ L = h ■ I ■ Co (56) 


but for a body that is rotating around the h axis, Co = con and so 

h ■ L — n ■ I ■ Co — n ■ I ■ loti = uon ■ I ■ n (57) 

But we showed last week that u times the moment of inertia about the axis ft, /,,, is to times the 
component in the h direction of angular momentum associated with angular velocity u h. Thus 
(57) implies that 

In = h ■ I ■ h (58) 

This means that our tensor / contains all the information about the moment of inertia about any 
possible axis! 

Let’s see what / looks like for the example system if we take the reference point R to be the 
center of mass. If the rectangular frame has sides of length t and C, and we center the object in the 
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z — 0 plane, the positions of the mass are as shown below: 


3 = 4 

:-f/2,r/2, 


3 = 3 
-f/2..-t'/2. 



3 = 1 

m,£'/2,0) 


3= 2 

(£/2,-£'/2,0) 


(59) 


The center of mass (the vector /?) is at the origin, by symmetry or explicit calculation 

^ rrijfj = ( 0 , 0 , 0 ) 

j 


Then we have 


rrij — m, fj — R = (±£/ 2,±£ / /2,0) 


(60) 


(61) 


where the ± signs in (61) are independent, running over all four possibilities as j runs from 1 to 4. 
Now we can compute the various components of the moment of inertia tensor: For example 


L = £ m, (ft - Rf - (r, - R)l) 


= £ "*j (( r j - + ( T i - R)T) = m 


,/2 


lyy = E m i (( r i - #)* + ( r i - R )l) =mZ 2 
j 

Ixy = lyx = — m j (( r j ~ R)x{ r j — R )y ) = 0 


( 62 ) 


Izz Ixx ~ Iyy Ixz lyz Izx Izy 0 
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In matrix form, this looks like 


/ = 


/mi ' 2 

0 

V 0 


0 

m £ 2 

0 



(63) 


m {£ 2 + t u ) 


Notice that the vanishing of I xy is slightly of non-trivial. It follows from the form of (61), but 
requires a cancelation between different terms in the sum. This happened only because we were 
clever (or at least not stupid) in our choice of coordinate system. 


An impulsive demo 

An impulsive force is one that is applied for a very short time. To get a large change in momentum 
in a very short time requires a very large impulsive force. Applying a very large force is a good 
way of breaking things. Here is an example. 
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lecture 19 

Topics: 

Where are we now? 

The moment of inertia tensor / 

L and Co 

The body frame 

Where are we now? 

We are poised to take the next big step into understanding the motion of rigid bodies. By putting 
together the ideas of impulsive torque and the angular velocity vector, we can solve some more 
interesting problems. Last time, we introduced the moment of inertia tensor. This time we will 
briefly review this and go on to apply it in interesting ways. 

Last time, we introduced the following problem. Suppose that we are out in space, and we 
come upon a light rigid rectangular frame with masses at the corners, as shown: 



x F 


( 1 ) 


It is floating at rest, say in the x-y plane, when we hit one of the masses with a hammer, applying 
a large force in the — z direction for a very short time, as shown. This produces an impulsive force 
and torque on the rigid frame. The question is, immediately after the hammer blow, what is the 
velocity of each of the masses? We began this analysis last time by defining the moment of inertia 
tensor and the relation between the angular momentum L and the angular velocity Co. 


The moment of inertia tensor 

Let’s begin by reviewing what we said last time about the moment of inertia tensor. It may be 
comforting to note that the scary definition we arrived at last time, and will look at again today is 
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not the way we will usually calculate I. At the end of today’s lecture, we will begin to discuss 
easier ways. But it is important to understand where it comes from so we will review it and apply 
it to our floating frame problem. 

Let’s go back to our expression for the angular momentum of a rigid body about a point R, 

L = E if ~ R ) x Pi = E m i (f - R ) x (fj - R ) (2) 

3 3 

We showed last time that we can rewrite this as 


L = 


3 


mi 


(if ~ R) ■ (f -R))w- {f - R)({f - R) ■ u) 


(3) 


Equation (3) is the desired relation between L and uj. However, it is useful to write it in a different 
form. The first thing to notice is that both sides of (3) are vectors, and that the right hand side is 
proportional to the vector uj, but is not, in general, in the same direction as uj. The second thing to 
notice is that the second term in (3 ) involves a dot product. We can think of this as the sum that 
appears in a matrix multiplication. This allows us to write (3) in a slightly more useful form in 
matrix notation. 

First define a matrix B as follows: 


= E 


m.< 









( B xx 

Rxy 

B xz \ 

i 


B = 

E 

rrij {fj 

-R) 

(rj - 

R) 

= 

Ryx 

Ryy 

Byz 
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\ 

\b zx 

B zy 

B zz ) 

I 


f [rj 

-R 

]z [ r j ~ 

R]x 

[fj ■ 

-R 

\x[r 

j — R 

] y [f 

-R\. 

E [ r j ' 

- R 

ft 

-R 

]y i r j ~ 

R] x 

If ■ 

- R 

}y[r 

) ~ R. 

\y if 

-R)y[f - 

-R 


-R 

]z [rj - 

R]x 

If ■ 

-R 

1 z[f-R\ 

\y [ r j 

-R], 

* if ' 

-R 


(4) 


(5) 


We have used a notation of two vector signs to indicated this matrix (Dave, in his book, just uses 
bold face and you have to figure out from the context whether you are dealing with a vector or a 
matrix). In general, this is not a good thing to do, but for our purposes in this course, it doesn’t 
do any harm to use a slightly sloppy notation in which we think of our matrices as “bivectors.” 
In fact, it will be kind of helpful. The reason is that the matrices we will have to deal with are 
symmetric, so we don’t have to be careful to distinguish which vector index is which. You can see 
the symmetry from the definition of B, because 


B X y Ryx i 


B xz B zx i Ryz Rzy ■ 


So we can combine the two terms in (3) as a single matrix equation, 

L — I ■ uj 


( 6 ) 

(7) 


where I is the matrix 


f B X X “I” Byy + B zz 

0 

0 ) 

( B xx 

B X y 

B xz \ 

° 

Bxx "I - -Byy B zz 

0 

J “ Byx 

Byy 

Byz 

V o 

0 

B X X Byy ~\~ B zz ) 

\B ZX 

Bzy 

B zz ) 


( 8 ) 
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or 


I 



(9) 


Like the matrix B, the matrix / is symmetric — 
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Thus / is completely determined by 6 numbers, 



(ID 


The object / is a thing called the “moment of inertia tensor.” The term “tensor” refers to a 


large class of objects in mathematics that are generalizations of vectors. I was not planning on 
talking about tensors in general, but there were so many questions about the concept on the QA, 
that I felt I had to say something about it. So here is a definition of “tensor” that makes sense to a 
physicist (mathematicians would say things much more abstractly). A tensor is anything that can 
be written as a linear combination (that is just a sum with coefficients) products of vectors. 
We have talked a lot about what makes a vector a vector, which is the way it transforms under a 
symmetry transformation. And we know that there are different kinds of vectors depending on what 
symmetry we are talking about. The 3-vectors you are familiar with from high school transform 
under rotations in 3-dimensional space, but we have also now seen 4-vectors that transform under 
Lorentz transformations. And there are other things are useful as well. The moment of inertia 
tensor is a sum of products of ordinary 3-vectors. There are different kinds of tensors depending 
on how many vectors you multiply together. The number of vectors in the product is called the 
“rank” of the tensor. A vector is a rank 1 tensor. Just like with vectors, you can add tensors of the 
same rank together and they are still tensors of the same rank, and you can write tensor equations 
in which both sides are tensors of the same rank. And indeed this is a good thing to do, because 
then if the equation is true in one coordinate system, it is automatically correct in all coordinate 
systems. 

You can see how this works explicitly in (4) and (5) where you see that each component of B 
(which is a rank 2 tensor) is a linear combination of products of components of the vectors fj — R. 

The reason that I didn’t want to spend time on this is that it is a little more complicate to explain 
why the first term in (8), 



( 12 ) 


is a tensor. This doesn’t look like a linear combination of products of two vectors, but actually we 
can write it as 


A = ( B xx + Byy + B zz )(xx + yy + zz) 


(13) 
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This is special and important feature of the identity matrix. So in fact, both terms in (8) are linear 
combinations of products of two vectors, and so / is a rank 2 tensor. 

Rank 2 tensors, like the moment of inertia tensor, are special because they can be written as 
matrices and thought of as a machine for taking linear combinations of the components of one 
vector (to) to get another vector (L). “Matrix” is just a generic term for something with rows and 
columns. The particular matrices that we care about are rank 2 tensors. Equations (7) and (9) 
show that you can also think of / as a 3 x 3 matrix, and think of the right hand side of (7) as a 
matrix product of the matrix / and the vector Co. Because (7) is an equation for a vector, it has a 
meaning independent of the particular coordinate system in which we describe the vectors. But 
like a vector, the tensor / will have different components in different coordinate systems. What 
makes the moment of inertia tensor a tensor is that when we go from one coordinate system to 
another, the components of / transform in the just the right way so that (7) is still a correct vector 
equation. That is the meaning of the two vector signs on I. This is implicit in (7), in which one of 
the vector signs on / is associated with the vector nature of L on the left hand side, while the other 
vector index is combined by the dot product with the vector index of Co so that the right hand side 
behaves like a vector. Incidentally, for the mathematically inclined, note that taking a dot product 
in this way is general way of reducing the rank of a tensor. If we just multiplied the 9 components 
of / with the 3 components of Co in all possible ways, we would get 27 components of a rank 3 
tensor 

C = I Co (14) 

But by taking the dot product, we can get rid of two vectors and make a rank 1 tensor - that is a 
vector. This will be very important in our understanding of rigid bodies. 

In term of I, we can quickly derive for rotation about a fixed axis, n, the component of angular 
momentum in the n direction. In general, the n component of L is 

n ■ L — n ■ I ■ Co (15) 


but for a body that is rotating around the n axis, Co = ton and so 

n ■ L — n ■ I ■ Co — n ■ I ■ toft — to n ■ I ■ n (16) 

But we showed last week that uo times the moment of inertia about the axis h. In, is uo times the 
component in the h direction of angular momentum associated with angular velocity uo h. Thus 
(16) implies that 

In = h ■ I ■ h (17) 

This means that our tensor / contains all the information about the moment of inertia about any 
possible axis! 

Let’s see what / looks like for the example system if we take the reference point R to be the 
center of mass. If the rectangular frame has sides of length t and V, and we center the object in the 
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z — 0 plane, the positions of the mass are as shown below: 


3 = 4 

:-f/2,r/2, 


3 = 3 

-/:/2.-r/2. 



3 = 1 

m,£'/2,0) 


3= 2 

(£/2,-£'/2,0) 


(18) 


The center of mass (the vector /?) is at the origin, by symmetry or explicit calculation 

R ^ rrijfj = ( 0 , 0 , 0 ) 

j 


Then we have 


rrij — m, fj — R = (±£/ 2,±£ / /2,0) 


(19) 


( 20 ) 


where the ± signs in (20) are independent, running over all four possibilities as j runs from 1 to 4. 
Now we can compute the various components of the moment of inertia tensor: For example 


L = £ m, (ft - Rf - (r, - R)l) 


= £ "*j (( r j - + ( T i - R)T) = m 


,/2 


lyy = E m i (( r i - #)* + ( r i - R )l) =m £ 2 

j 

Ixy = lyx = — m j (( r j ~ R )x{ r j — R )y ) = 0 


( 21 ) 


Izz Ixx ~ Iyy Ixz lyz Izx Izy 0 
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In matrix form, this looks like 


/ = 


/mi' 2 

0 

V 0 


0 

m£ 2 

0 



( 22 ) 


m {£ 2 + 1 ") 


Notice that the vanishing of I xy is slightly of non-trivial. It follows from the form of (20), but 
requires a cancelation between different terms in the sum. This happened only because we were 
clever (or at least not stupid) in our choice of coordinate system. 


L and 0 

The important thing about the moment of inertia tensor is equation (7), which describes the con¬ 
nection between the angular momentum vector and the angular velocity vector. Because / is a 
tensor, and not simply a number, (7) implies that these two vectors are generally not in the same 
direction. There is a very nontrivial relation between them that depends on the form of I. 

One example of the non-trivial connection between L and u is the fact that if one of these 
vectors is constant, the other is generally changing with time. For example, imagine that we take 
the rectangular frame we have been discussing and run a frictionless rod through the centers of 
masses 1 and 3, as shown below 
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will have an angular velocity vector along the rod, as shown below: 



of L, we multiply u by /, 

(L x \ /ml' 2 0 0 \ (u x \ / m£ ,2 u> x \ 

L — I L y I = I 0 ml 2 0 I [ Uy I = I ml 2 LU y (25) 

\L Z ) \ 0 0 m(£ 2 + £' 2 ) / \lu z / \m{l 2 + £' 2 )uj z ) 

In this case, ui z = 0, so this has the effect of multiplying the x and y components of 0 by different 
constants, 

L x £xx^x Tn£ UJ X > Ly lyy^x Vfx£ LOy - (26) 
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and this implies that L is in a different direction, proportional to (£', £, 0), as shown below: 



Now as the frame rotates about the frictionless rod, the angular momentum vector rotates with 
it! The component of L in the direction of the frictionless rod, L\\, does not change. Because 
there is no friction, there can be no torque in the direction of the rod, and thus the rate of change 
of this component of L vanishes. That in turn means that the magnitude of u remains constant 
as well. But the rod can and does produce forces on the frame perpendicular to the rod. These 
forces, shown below, produce the torque that causes the perpendicular component of the angular 
momentum vector, L±, to change. This component executes uniform circular motion with angular 
frequency u = |cu| in the plane perpendicular to u. Thus the magnitude of the torque is 

\u x L\ — cu|L_l| (28) 

This business of angular momentum and angular velocity being in different directions may not 
make obvious sense to you. But you should be able to feel this torque in your bones. You should 
be able to see that the accelerations required to keep this system in uniform circular motion are 
in different planes, and that this must produce a torque. Since torque is rate of change of angular 
momentum, you can go backwards and conclude that the angular momentum does not point along 
the angular velocity. I hope this will help you develop intuition for the meaning of the moment of 
inertia tensor. 
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A practical example of the torque that result from angular momentum that is not in the direction 
of the angular velocity is the process of balancing tires for your car. Here is a figure from the web 
(at http://www.discounttire.com/dtc/brochure/info/tireBalance.jsp): 



Static Imbalance: Dynamic Imbalance: 

Occurs when there is a Occurs when there is 

heavy or light spot in the unequal weight on one 

tire so that the tire won't or both sides of the 

roll evenly and the tire tire/wheel assembly's 


The reason that a mechanic spins your tire during the balancing process is check for dynamic im¬ 
balance, which results from mass that is balanced around the center of mass but is not symmetrical 
with respect to the axis. When a tire with this imbalance spins on the axle, the angular momentum 
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is not in the direction of the axle and the perpendicular component of L undergoes uniform circular 
motion with the tire. Thus there is a non-zero torque, u x L which causes the tire to shimmy. 

Now back in the original problem of hitting the frame while it is floating in space, I hope that 
you can now begin to see why in general it is hard to find the trajectories of our masses after the 
system starts to rotate. In this case, after the hit, it is the angular momentum vector that is fixed, 
rather than the angular velocity vector, but the relation between them is still changing in time. 

Let me say this once more in more generality. If a rigid body is rotating freely, the angular 
momentum vector L is fixed in space (because this is an inertial frame - often called the space 
frame),, but the rj-s that determine the position of the masses in the rigid body are determined 
by the orientation of the body, so they are all rotating about the center of mass and constantly 
changing in some way that depends on the angular velocity vector u. We can calculate u from L if 
we know the moment of inertia tensor, I. But to find /, we need to know the r ; s that depend on the 
orientation of the body. Therefore the form of the moment of inertia tensor is changing in time. It 
depends on the orientation of the body, which itself is constantly changing. But this means that u 
is constantly changing. So this is an incredible mess, and it seems circular. What are we going to 
do to extricate ourselves? 

The body frame 

The way out of this dilemma is to think about a frame in which the moment of inertia tensor 
remains simple. The moment of inertia tensor is constant in a coordinate system that is fixed on 
the body, and rotates with it. The moment of inertia must be constant in this frame because / is 
determined by the masses and position vectors that describe their positions within the body, and if 
the body is at rest, these position vectors are all constant. This is called the body frame. The body 
frame is certainly not necessarily an inertial frame. In a week or so, we will see how this modifies 
Newton’s laws in such a frame. But even though it is not an inertial frame, it will be useful to us 
in our analysis of rigid body rotations. 

In the body frame, there is a particularly simple choice of coordinate system that makes the 
moment of inertial tensor look simple. For every body, there are three perpendicular directions, 
described by unit vectors e\, e 2 and e 3 , which have the nice property that for a rotation about 
an axis in one of these directions, the angular momentum is in the same direction as the axis of 
rotation. This is a general result from linear algebra. 1 The axes in the directions e 1; e 2 and e 3 are 
called the principal axes of the body. Then we say that the body has a moment of inertia /, about 
the axis ei, a moment of inertia J 2 about the axis e 2 , and a moment of inertia J 3 about the axis e 3 . 
We can write the tensor / in any coordinate system as a sum over the three principal axes, 

I — ?i 6] ei + / 2 e 2 e 2 + / 3 e 3 e 3 (30) 

'Formally, this is the statement that we can always choose a coordinate system in which a given symmetric matrix 
is diagonal. 
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If we choose a coordinate system in which ei, e 2 and e 3 are the basis vectors, then / is diagonal. 

_ fh 0 0\ 

/ = 0 h 0 (31) 

VO 0 ij 

The moments Ij around the principal axes are called the principal moments. The directions of 
the three principal axes completely specify the orientation of the body. The principal axes in our 
example above are the x, y and z axes. 

In matrix language, the principal axes have the following properties: 

ei • ei = e 2 • e 2 = e 3 • e 3 = 1, <?i • e 2 = e 3 • e 3 = e 2 • e 3 = 0 , (32) 

and 

I • ei = h ei, I • e 2 = I 2 e 2 , / • e 3 = J 3 e 3 . (33) 

This is the mathematical statement of the physical properties that I just described. In linear algebra 
lingo, the vectors ij that describe the principal axes are the eigenvectors of the matrix /, and the 
principal moments, Ij, are the corresponding eigenvalues. 

For rotation about a principal axis, the angular momentum is in the same direction as the 
angular velocity and the moment of inertia is the principal moment. For example for rotation about 
the axis ei, the angular velocity has the form O — uj e\, so (33) gives 

L = I ■ Co = to I ■ ei = u Iiii = Ii to (34) 

so that indeed the angular momentum is just the principal moment times the angular velocity vector. 
You can also see explicitly from (30)-(33) that the principal moments are the moments of inertia 
about the principal axes: 

ei • I ■ ei = h , e 2 • J • e 2 = J 2 , e 3 ■ I ■ e 3 = J 3 . (35) 

This gives us an easy way to calculate the principal moments once we know the principal axes. We 
can use the expression for the moment of inertia around a fixed axis 

h ■ I ■ h — In — TV rrij \n x (fy — R )[ 2 (36) 

i 

to write 

h = Y, m 3 lei x (fj-R )| 2 

3 

h — ^2 m 3 1^2 X ~ R)\ 2 (37) 

3 

/ 3 = ^m i |e 3 x (fj-R )|- 

3 
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If the three principal moments of the object have different values, then the principal axes are 
unique (we will see later what happens with principal moments are equal). In this case, the prin¬ 
cipal axes are a kind of built in coordinate system in the body frame. Giving the directions of the 
principal axes in space specifies the orientation of the body in space. 

Note that in the impulse problem that we started at the beginning of the lecture, we found that 
for the choice of coordinate system we made, / was diagonal. That means that the principal axes 
were the coordinate axes. This is what I meant by saying that we had chosen a coordinate system 
that wasn’t stupid. In fact, we will see next time how to avoid stupid coordinate systems - and this 
will be the best way of actually calculating the moment of inertia tensor in all the cases we discuss 
in this course. 

Finally, for the mathematicians in the class (and the Les Phys fans - which I hope is everybody), 
note that we can regard / as a bilinear form - that is a machine that takes two vectors A and B 
into a number, via the map 

A,B A-I-B (38) 

Because the matrix / is symmetric, this is a symmetric bilinear form — 

A-1- B = B A (39) 

Finally, you can show that if the object is truly three dimensional, and not a mathematical abstrac¬ 
tion like an infinitely thin rod or a point mass, then all the principal moments are greater than zero. 
This means that this is a positive definite, nondegenerate, symmetric bilinear form. 
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lecture 20 


Topics: 

Guessing principal axes - theorems 
Another example - rectangular solids 
Rotation about a principal axis 
When principal moments are equal 
Proof of the relection theorem 
Finishing the impulse problem 
The final velocities 
After the impulse 


Where are we now 

Having described the moment of inertia tensor in the last lecture, we are now going to start to see 
what to do with it. First, let us sum up the most important properties. 

1. The angular momentum L of a rigid body is related to its angular velocity vector u by the 
matrix equation 

L — I ■ uj (1) 

where / is the moment of inertia tensor which in matrix form looks like 


'yy B zz 

Bxy 

—B xz \ 


Byx 

Bxx B zz 

— Byz 1 

(2) 

B zx 

Bzy 

Bxx Byy ) 



where the symmetric matrix B is defined as 

b = m j Co - R ) Co - R ) (3) 

3 

This is physics. It follows from the definition of the angular momentum and the equation 
for the motion of the parts of a rotating rigid body. 

2. At any given time, there are three perpendicular axes 

e 3 for j — 1 to 3 (4) 

called the principal axes such that if 0 is in the e 3 direction, then L is also — 

/ • uij = Ij uij (5) 

We can write the tensor / in any coordinate system as a sum over the three principal axes, 

1 = he 1 e 1 +/ 2 e 2 e 2 +/ 3 e 3 e 3 (6) 
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If we choose a coordinate system in which e\, e 2 and e 3 are the basis vectors, then / is 
diagonal. 

_ (h 0 0\ 

/ == 0 I 2 0 (7) 

Vo o ij 

The moments Ij around the principal axes are called the principal moments. The directions 
of the three principal axes completely specify the orientation of the body. The principal axes 
in our example above are the x, y and 2 axes. 

This is mathematics. The principal axes ei, e 2 and e 3 are the eigenvectors of the real sym¬ 
metric matrix /, and the principal moments I \, J 2 and J 3 are the corresponding eigenvalues. 

3. The principal moment Ij is the moment of inertia about the principal axis e r 

4. The principal axes are tied to the body — they rotate as the body rotates. This is one of the 
things that makes rigid body rotations complicated if u is not along a principal axis. 

Let us now return to our auto mechanic checking your tires for dynamic imbalance. 



Static Imbalance: 

Occurs when there is a 
heavy or light spot in the 
tire so that the tire won't 
roll evenly and the tire 



Dynamic Imbalance: 

Occurs when there is 
unequal weight on one 
or both sides of the 
tire/wheel assembly's 


What the auto mechanics is trying to do is to balance your tire so that the axle is a principal axis of 
the tire. Once that is done, the angular momentum of the tire spinning around its axle is along the 
axle and does not change as the tire spins. Therefore no torque is required to keep the tire spinning 
on it axle, and there is no shimmy! 

Guessing principal axes - theorems 

Now that we know about principal axes, it is useful to discuss the question of finding the moment 
of inertia tensor again. The general formula in (2) and (3) is certainly something you should know, 
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but with any luck you will seldom have to use it. Instead, you can remember a couple of more 
general things. The first thing to notice is that it is MUCH easier to find the moment of intertia 
tensor if you already know what the principal axes are. My recommendation is that instead of 
spending your time understanding the ins and outs of the general formula, 

1. you should understand how to guess the principal axes, and 

2. you should understand how to find the tensor if you already know the principal axes. 

This is a far better way, in practice, to actually compute I. 

Let’s look at step 2 first. Suppose that you know the principal axes. Then you can rotate to a 
coordinate system in which the principal axes are the coordinate axes, e 3 — x, e 2 = y, e 3 = 2 . In 
this coordinate system, / is diagonal and 

I xx /1 ■ lyy f2 ■ f z : f3 ■ (8) 

So for example, in our rectagular example, 


3 = 4 

-e/2, e/2 


3 = 3 

-e/2, —d/2 



3 = 1 

{e/2, e'/2, o) 


(9) 


3= 2 

{e/2, ~e'/2, o) 


if we know that the x, y and z axes are principal, we can immediately conclude that I xx = m £' 2 
because there are four masses each of mass m and each a distance l'/2 from the x axis. 

Now let’s go back to step 1. The primary tool here (it will probably not surprise you) is 
symmetry. There are three theorems that will be useful. 

The equal moment theorem: Any vector in the plane formed by two different principal 
axes of a rigid body with equal moments is also a principal axis with the same moment. 

The reflection theorem: If a rigid body is invariant under reflection in a plane, the vector 
perpendicular to this plane is a principal axis. 
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The rotation theorem: If a rigid body is invariant under a rotation of a < 2n about an axis 
n, then (1) n is a principal axis, and (2) if a ^ n, all vectors in the plane perpendicular 
to n are principal axes with the same principal moment. 

We will prove these later. Let’s first see how to use the reflection theorem to guess the principal 
axes of our frame example. 

This theorem immediately implies that the principal axes in our example problem are the x, 
y and z axes, because the system has three planes of symmetry, the x-z and y-z planes (shown 
below) and the x-y plane (the plane of the paper) 

symmetry 



( 10 ) 
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Another example - rectangular solids 

Consider a rectangular solid with height h, width w and depth d, with a uniform mass density, 



The principal axes are the obvious ones — parallel to the edges, perpendicular to the faces. Again 
the reason is symmetry. The body is symmetrical under reflections through planes through the 
center of the body, parallel to any of the faces. So by our theorem, the vectors perpendicular to the 
faces are principal axes. 

Now we can calculate the principal moments by calculating the moments of inertia about the 
principal axes . For example, moment of inertia about the axis e\ can be written as an integral 



Rotation about a principal axis 

If a rigid body rotates freely about one of its principal axes, the angular momentum is in the same 
direction as u. This makes it easy to understand the physics. Suppose that the principal axis e\ is 
lined up with the 5 direction, and uj is also in the z. Then the angular momentum is also in the z 
direction and is given by 

L = lid; = Ii\lu\ z (13) 

Conservation of angular momentum then tells you that uj remains constant as well. So the body just 
rotates with constant angular velocity about its principal axis, which remains in the same direction 
in the space frame. The other two principal axes, of course, are not fixed in the space frame. They 
rotate around with the body. This is illustrated in the Mathematicci file rectangle.nb, which shows 
physically sensible motions of a rectangular rigid body. The blue line in the animation represents 
the angular velocity vector and the angular momentum vector, which are in the same direction. No 
problem! 

But if at some time the angular velocity is not lined up with any of the principal axes, then the 
situation is much more complicated. Now the fact that the angular momentum is constant doesn’t 
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immediately tell you about the angular velocity, because the components of the moment of inertia 
tensor / are changing with time in the space frame because of the rotation. But that in turn means 
that Co is changing with time. Later, we will try to understand this much more complicated motion 
for a symmetric top. 

Before we get started on this difficult path, I want to show you some allowed motions of rigid 
bodies that LOOK complicated, but actually are not. We will do that in the next section. 

When principal moments are equal 

Something interesting happens when two of the principal moments are equal, as in a rectangle with 
w = d 7 ^ h. In this case, 

h = h = ^(h 2 + d 2 ) = ^(h 2 + w 2 ) and I 3 = ^(d 2 + w 2 ) = ^d 2 (14) 

In this case, the principal axis corresponding to the unequal moment is unique, but the body 
is happy to rotate about any axis in the plane formed by the two principal axes corresponding to 
equal moments. The reason is the equal moment theorem: Any vector in the plane formed by 
two different principal axes of a rigid body with equal moments is also a principal axis with 
the same moment. Let’s prove this. It follows because of the linearity of the fundamental equation 
that defines the principal moments. Suppose for example that I x = J 2 = I. Then 

I ■ ei — I ei and I ■ e 2 — I e 2 . (15) 

but that means that any linear combination of c\ and e 2 satisfies the same equation: 

/ • (aiei + a 2 e 2 ) = cq / • rfi + a 2 / • e 2 = Gq /ei + a 2 /e 2 = / (Gqei + a 2 e 2 ),. (16) 

But with an arbitrary linear combination, we can get any vector in the plane spanned by eq and e 2 . 
This completes the proof of the equal moment theorem. 

Notice that the same linearity argument, (16), works even if the principal axes e\ and e 2 are not 
orthogonal, that is e± ■ e 2 jC o, so long as the two vectors are not equal. Indeed, the only time one 
can have principal axes that are not orthogonal is when they both have the same principal moment, 
because only then can we take linear combinations using (16) to make orthogonal axes and thus 
satisfy our general theorem that the momentum of inertial tensor can be made diagonal by going 
to the right coordinate system. Actually, (16) shows that in this case, the coordinate system is not 
unique. Any orthonormal pair of vectors in the plane formed by e i and e 2 is a perfectly good basis 
and leads to the same form for /. 

A corolary of this and the general fact that every body has three perpendicular principal axes 
is that if you find two principal axes that are not perpendicular to one another, you know that their 
principal moments must be equal even before you calculate them. 

The rotation theorem is closely related to this. If a rigid body is invariant under a rotation 
of a < 2 tt about an axis n, then (1) n is a principal axis, and (2) if a ^ tt, all vectors in the 
plane perpendicular to n are principal axes with the same principal moment. 
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Let’s prove (1) first. 

First note that there must be some principal axis e.\ with a nonzero component in the n direc¬ 
tions. Then we can write 

e.\ — an + bp (17) 

where p is some unit vector in the plane perpendicular to ft. Under the rotation by a about n, n 
doesn’t change, but p rotates in the plane perpendicular to h. Because of the symmetry, each such 
rotation must give a principal axis with the same moment as e\. Then there are three possibilities: 

1 . b = 0, in which case e\ oc ft and (1) is true; 

2 . b ^ 0 and a = it, in which case p changes sign under the rotation so that e\ = an + bp and 
e\ = an — bp are both principal axes with the same moment - but the e.\ is in the plane of 
two principal axes with the same moment and is a principle axis, and again (1) is true; 

3. b ^ 0 and a = n, in which case a seqence of rotations gives at least three linearly indepen¬ 
dent principal axes with the same moment, so any vector is a principal axis, and again (1) is 
true. 


Then (2) follows because if a ^ tt, we can look at a principal moment perpendicular to ft and it 
rotates into a linearly independent vector perpendicular to ft, and then the equal moment theorem 
gives the result. 

A body with two equal principal moments is called a symmetric top because one simple way 
to get two equal moments is to have a continuous symmetry that rotates e\ into e 2 , as in a top like 
that shown below: 



(18) 


In the case of a symmetric top, it is obvious that any vector in the plane of the disk is a principal 
axis, because all these directions are physically equivalent. We can rotate one into another. Note, 
however, that as we have seen above such a symmetry is not necessary. As long as Ii = J 2 , any 
vector in the plane of e\ and e 2 is a principal axis. In the rectangular solid with d — w, these 
directions don’t all look equivalent, but they are equivalent so far as rotations are concerned. Thus 
for example a rectangular solid with d = w can rotate happily about any axis perpendicular to 
its length. This rotation looks more complicated than the rotation about an axis perpendicular to a 
face, but really it isn’t. An example is shown in the Mathemcitica file cylinder.nb, which illustrates 
that the behavior of such a rectagular solid with respect to rotations is the same as that of a cylinder. 
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A nice symmetrical body like a sphere has all three principal moments equal, 

h = h = h = I =\mr 2 (19) 

5 

where m is the mass and r is the radius. For such a symmetrical object, the angular momentum is 
always in the direction of the angular velocity 

L = I Co (20) 

As with the symmetrical top, (20) does not depend on the body actually being rotationally sym¬ 
metric. It depends only on the fact that /i = / 2 = h- F° r example, a solid cube is clearly not 
invariant under arbitrary rotations. However, it does have Ii — I 2 — I 3 because it is symmetric 
under discrete rotations by 90° about lines perpendicular to the faces. This is enough to guarantee 
that the angular momentum is parallel to the angular velocity for a cube. So a cube is happy to 
rotate about any axis at constant angular velocity, whether or not the axis is lined up with one of 
the special axes in the cube. The same is true of any other regular solid, as in the Mathematica file 
regular-solid.nb. The three moments of inertia are equal. 


Proof of the reflection theorem 

Here is the theorem again. If a rigid body is invariant under reflection in a plane, the vector 
perpendicular to this plane is a principal axis. Here is the proof. Suppose we have a plane of 
reflection and we construct the vector perpendicular to the plane. Some principal axis must have 
at least some component along this vector. But then because of the symmetry, we know that if we 
reflect this axis in the symmetry plane, we must get another principal axis with the same principal 
moment, as illustrated below: 



Now there are two possibilities. One possibility is that the reflected axis coincides with the original. 
The only way this can happen is for the original axis to be perpendicular to the symmetry plane, 
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in which case the theorem is true. The other possibility is that the reflected axis is different. 
But then there are two distinct axes with the same principal moment (because they are physically 
equivalent due to the symmetry). But then any axis in the plane formed by these two is a perfectly 
good principal axis — in particular the one perpendicular to the symmetry plane — so again the 
theorem is true. 

This is illustrated in Mathematica file axes2.nb. There is a slight subtlety that is also illustrated 
in this file. A reflection is not equivalent to a rotation by 180°. And in fact, symmetry under a 
rotation by 180° doesn’t tell you anything about principal moments in the plane of the rotation. 
Symmetry under a rotation by 2njn for n > 3 is enough to guarantee that any vector in the plane 
perpendicular to the axis of rotation is a principal axis with a principal moment independent of the 
angle in the plane. But this is not true for n — 2 — symmetry under a rotation by 7r or 180°. This 
only tells you that the axis is a principal moment. 

Finishing the impulse problem 

Let’s review where we are with the light rigid rectangular frame with masses at the corners, as 
shown: 


j = 4 

-£/ 2 ,£'/ 2 , 


3 = 3 

-£/ 2 ,-£'/ 2 , 



3 = 1 


3 = 2 

(£/ 2 ,-£'/ 2 , 0 ) 


( 22 ) 


where the force acts for a very short time, and the impulse is into the paper, with the form 

J dt F = —z P (23) 

for positive P. Earlier, we found the center of mass to be at the origin and we found the moment 
of inertia tensor for this object about its center of mass to be 


I xx tti 




Iyy = m£ 2 


£ ZZ £ XX + I ■ 


yy 


I xy I xz Iyz 0 


(24) 
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To complete the story, we need to compute the velocity of the center of mass, and the angular 
velocity about the center of mass. The velocity is easy. We know the impulse, and we know the 
total mass, so we can compute the change in momentum and thus the final velocity. We’ll do this 
in a minute. First we will do the interesting part and compute the angular velocity. 

To do this, we first compute the impulsive torque about the center of mass — that is 

J dtf (25) 

over the short period during which the force acts. This is 

J dt(r 2 -R)xF = (e/ 2 , -i!/2, 0) x (0, 0, -P) = (Pi'/ 2, Pi/ 2, 0) (26) 

where we have used the fact that the force acts for a very short time, so that r 2 and R do not change 
significantly while the force acts, and they are therefore effectively constants and can be taken 
outside the integral in (26). This is equal to the change in angular momentum about the center of 
mass, which is the final angular momentum about the center of mass. 

L = (Pg/2, Pi/2, 0) (27) 


But the angular momentum is related to the angular velocity by 


L = I ■ uj 


(28) 


Because L is conserved after the force acts, (28) remains true for any time afterwards. But here 
is the point. Immediately after the force acts, the body has still not had a chance to move from its 
position in (22). Therefore, the moment of inertial tensor is given by (24), which implies (using 
(24) and (27)) 

(29) 


(Pi'/2, Pi/2,0) = (mi' 2 u) x ,mi 2 ujy,T zz uj 


which in turn implies 


P 


P 




2 mi' UJy 2 mi 


uj z = 0 


(30) 


This is the desired result for the angular velocity about the center of mass immediately after the 
hammer blow: 

W= (2^ , W , °) (31) 

The directions of the vectors L and uj are shown below (in an arbitrary relative normalization, 
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because they have different units). 


j = 4 

-1/2,e/2 


3 = 3 

-i/ 2 , ~e /2 



3 = 1 

[1/2, i'/2, 0 ) 


J = 2 

{i/ 2 ,-e/ 2 ,G) 


(32) 


It is amusing (though not particularly signficant) that cu points at mass 1. 

(31) is very easy to understand in the limit l — i'. In this case, the frame is a square, the x 
and y principal moments are equal, so we can take the principal axes to be the diagonals, and (31) 
simply describes rotation about the obvious diagonal, shown as the dashed line below: 



(33) 
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The final velocities 


Now to get the velocities of the individual masses, we use the fact that the motion of the body is a 
combination of the translational motion of the center of mass plus the rotational motion about the 
center of mass. The final velocity of the center of mass is the impulse divided by the total mass: 




The velocity of mass j is then given by 


V + Co x (fj — R) 


where 0 is given by (31). Here they are: 


* = U-£V P p 


4 m ) \ 2m £' ’ 2m £ 




v 2 — 0, 0, — -— + 


P\ ( P P 


Am) \2m£' : 2m£ 


,0 x 


2 ’ 2 


,0 


^3 — f 0 , 0 , — -—) + 


p p 


Am ) \ 2m £' ’ 2m £ 

=(°'°--£ 


■° X -2’-2’° 


^ 4 — 0 , 0 , — -— + 


P\ ( P P X / OB! 


Ain J \2m£ r 2m£ 


■° x r 2 ’ 2 ’° 


(34) 

(35) 

(36) 

(37) 

(38) 

(39) 


After the impulse 

I keep saying that things get so complicated after the impulse that I don’t want to require you to 
understand the general case. But I think it may be useful to see how we would use Mathematica to 
study it. Mathematica allows us to follow the motion numerically in a pretty simple way, because it 
gives us simple tools for dealing with rotations. Let’s look at the notebook after-the-impulse.nb. 

The key is to keep track of the rotation matrix R(t) (called r in the notebook) that rotates the 
body from its initial orientation to its orientation at time t. If we know this, and we know uj(t) 

(called u in the notebook), then we can calculate R(t + dt ) — 

R(t + dt) = RotationMatrix[|u;| dt, lu] ■ R(t) (40) 
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where RotationMatrix[0,u;] (abbreviated as rm in the notebook) rotates by 6 about the vector u. 
This is just translation into matrix language of our cross product formula for the motion of a vector 
rotating with angular velocity u. But we can also find 0 from the fixed angular momentum L 
because we can find the moment of inertia tensor in terms of R(t), 


_ -» -» -> T 

I(t) = R(t ) ■ 1(0) ■ R(t) 


(41) 


Then 


^ -l ^ 

u(t) = I(t ) ■ L 


(42) 


That is how the animation works. We look at the motion after the impulse in a frame moving along 
with the center of mass. The green line shows u and the red line shows L. Afterlt wanders all over 
the place as the body tumbles out of control. 
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lecture 20 


Topics: 

The free symmetric top in the space frame 

Vectors in the body frame 

The free symmetric top in the body frame 

Euler’s equations 

The free symmetric top ala Euler’s 

The tennis racket theorem 

As you know, a spinning top subject to a gravitational torque will precess. In fact, the situation 
is a bit more complicated. If the top doesn’t get started quite right, the motion is more complicated. 
The precession of the symmetry axis about the direction of the angular momentum that we saw in 
our discussion of the free symmetric top gives wise to the phenomenon of nutation in precessing 
symmetric tops. We will see how this works, and then go on to see what we can say about the 
general case by looking at it in the body frame. 

The free symmetric top in the space frame 

Now we are finally going to face the music and try to figure out analytically what happens AF¬ 
TER the initial impulse when the angular momentum and the angular velocity are not in the same 
direction. However, I am not going to be so cruel as to try to completely solve the problem in 
the case of a general rigid body. As I said, that is too hard. But we will analyze completely only 
the case of the free precession of a symmetric top, with two equal moments of inertia. It not just 
that I am taking pity on you. I think that it is possible to understand this in a way that is probably 
impossible for the general case. Since I am interested in getting you to really understand things, 
rather than just cramming facts into your head, I will focus on the simpler problem. I am going 
to do this in a very slick way. We won’t need any complicated differential equations — just the 
relation L — I ■ u and the connection of the cross product to uniform circular motion. I hope that 
you will find it so simple that you will be able easily to reproduce it yourself, though you may find 
yourself wondering what happened. 

Later, we will go back and do it the hard way. 

Suppose that a rigid body with two equal principal moments (this makes it a symmetric top) is 
floating out in space, and is rotating freely about its center of mass. Because there are no external 
torques on the system, the angular momentum L is conserved. If L points along a principal axis 
of the body, then the angular velocity vector iu is parallel to L and it also remains constant. The 
interesting mathematical question is what happens to to when it is not parallel to L. The physical 
question is then what the resulting motion looks like! 

You should not be surprised that the key is the connection between the angular momentum and 
the angular velocity through the moment of inertia tensor — 

L = I ■ cu (1) 
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The important point is that in the space frame, the principal axes of the top, e 2 , e 2 and e 3 , are all 
moving around as the body rotates, but at any given time, they form a complete orthonormal set of 
vectors in terms of which we can expand any vector. For example 

cu = ei (ei • lo) + e 2 • co) + e 3 (e 3 • lo) = coi e\ + lo 2 £2 + ^3 63 ( 2 ) 

which just says that Co can be expanded in the basis vectors, e^. Using (2) in this basis, (1) becomes 

L = Ii e\ + I 2 lo 2 &2 + I 3 cu 3 e 3 (3) 

The relation (3) is entirely general, valid for an arbitrary moment of inertia, but here, we are dealing 
with a symmetric top, for which two of the principal moments are equal. Let’s assume that the two 
equal moments are I\ and J 2 , with corresponding principal axes e\ and e 2 - As you know from 
the equal-moment theorem, this means that any axis in the plane of e.\ and e 2 (which is the plane 
perpendicular to e 3 ) is a principal axis with the same principal moment. To remind you of this, I 
will define I± as 

I± = h = h ( 4 ) 

Then we can write (3) as 

L = Jj_ U\ e\ + ij_ 002 e-2 + h <^3 £3 ( 5 ) 

Another way of deriving (5) is to use our dyadic expression for the moment of inertia tensor, 

I = he 1 e 1 + I 2 e 2 e 2 + h hh (6) 

which for a free symmetric top looks l ik e 

I — h e-ih + 7j_ e 2 e 2 + I± e 3 e 3 (7) 

Now because of (4), we can eliminate all reference to e 2 and e 2 which enormously simplifies 
the problem. If we now multiply both sides of (2) by I± and subtract both sides of the result from 
(5), we get 

L = I± U 3 \ e\ + I L <^2 ^2 + I3 ^3 &3 

- [ /j_ Co = /j_ uj\ €\ I±_ co 2 e 2 + I± i 03 e 3 ] 

L — /_[_ to = (I 3 — /_l) cu 3 e 3 
or 

L = I±u + (I 3 — I±)co 3 e 3 = I±co + I± Ue 3 ( 9 ) 

where 

n = ( io) 
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There is an ambiguity in the sign of e 3 because the symmetry axis is not directional, and we resolve 
this by always taking CU 3 > 0. The constant Vt has units of angular velocity, and it always satisfies 
the inequality 

—CU3 < f) < 6U3 (11) 

because 

—I± < I'i — I± < I± (12) 

We will talk more about this in a general way soon. 

It will sometimes be convenient to rewrite (9) as an equation for to, 

cJ = L/I ± - f)e 3 (13) 


The relations (9) and (13) have nothing in them beyond the connection between angular mo¬ 
mentum and angular velocity and the fact that I\ = / 2 = I±. They are true at any time, but do not 
tell us directly about time dependence. However, it is the simplicity of these relations that allows 
us to solve this problem much more easily than we could for a general rigid body. Mathematically, 
it is the equality of I\ and J 2 that allows us to eliminate both e\ and e 2 from (9) and (13). Physi¬ 
cally, this is particularly important because the one remaining principal axis in (9) and (13) is the 
one we see in the typical symmetric top. The symmetry axis is special, and that makes (9) and (13) 
particularly useful. 

We can now get the time dependence by judiciously using the connection between rotation and 
the cross product. 

The first step is to notice that (>3 is a principal axis fixed in the body, and it is therefore rotating 
with the body. Like any other vector fixed in the body, e 3 has a time dependence given the the 
cross product with the angular velocity vector u. 


d , _ „ 

—e 3 = uxe 3 
at 


(14) 


The relation (14) is a differential equation for the time dependence of e 3 , but it is not particularly 
useful, because to is also changing with time in a way that we do not yet know. Instead of trying to 
bull our way through this, we can use (13) to write 


d „ 





L „ 

x e 3 = — x e 3 


(15) 


This is a much more useful equation, because L is constant. In fact, we don’t even have to solve it, 
because we already know exactly what it means geometrically. The time dependence of e 3 given 
in (15) is what we expect if €>3 is rotating about an axis in the direction of L with angular velocity 
L/I±. Since L is constant in the space frame, this must be what is actually going on for all times 
— e 3 is undergoing uniform circular motion with angular velocity L/I±. Now we are essentially 
done. From this, we can calculate everything. 

First notice that the component of the angular momentum in the e 3 direction (along the symme¬ 
try axis of the top), L ■ e 3 is constant. Geometrically, this is because e 3 is rotating around the fixed 
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vector L, so while its component perpendicular to L is constantly changing, its parallel component 
is not. Analytically it follows from (15) as follows: 

= = 7 Z X ‘ 3 ) =0 <16) 

But L ■ e 3 is related to lu 3 by (5), 

L ■ e 3 = I 3 uj 3 (17) 

Thus (16) means that uj 3 is constant, 

j f u 3 = 0 (18) 

This in turn means that Q is constant also, so the coefficients in (9) and (13) are constant in time. 
Note also that using (17), Q can be written as 

n= \L} h- cosQ ( 19) 

I± h 

where 9 is the angle between L and the symmetry axis (between 0 and 7t/2). 

Because of (13) and (18), we know that uj is a fixed linear combination of L and e 3 , and 
therefore since e 3 is undergoing uniform circular motion with angular velocity L/I j_, then uj is 
also. We can see this geometrically because (9) or (13) and (16) imply that L, e 3 and uj are fixed 
in a plane and then from (15) we can conclude that the plane formed by L, e 3 and uj rotates about 
the fixed angular momentum with angular velocity L/I±. We can also see this analytically 


d _ 
—u 
dt 


( 20 ) 

d 

dt 

(- L/I± - ^e 3 ) 

( 21 ) 

d 

dt 

(-fie 3 ) 

( 22 ) 

L 

= I± 

x (-fie 3 ) 

(23) 

L 

= H 

x (L/I _ l - Ue 3 ) 

(24) 

L 
= I± 

X UJ 

(25) 


Notice that we have still not figured out explicitly what the perpendicular axes e 3 and e 2 are 
doing. But this information is implicit in the relation (13). In words, this relation together with 
(15) says that the angular velocity of the top can be taken apart into two components: 

1 . the rotation of the top around its symmetry axis e 3 with angular velocity — Q e 3 ; and 

2. the rotation of e 3 around the fixed angular momentum with angular velocity L/I±. 
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Let’s review the few key steps in this derivation. First we used the form of the moment of 
inertia tensor to eliminate 

Let us now look at the result in animated form in the Mathematica worksheet freetop.nb This 
is an animation of a rotating rectangular solid, with two opposite sides square, so that it has two 
equal principal moments. This is an allowed free rotation of the body in space. The angular 
momentum is L = L z and is constant. The ratio of the length in the e 3 direction to that in the ei ;2 
directions starts at 2 with I 3 /I± = 0.4 less than one, corresponding to a prolate object — long and 
thin along the symmetry axis. The angle 9 is the angle between the angular momentum (the 5 axis) 
and e 3 , the symmetry axis of the body. The initial 9 is zero. Then u = z L/I 3 . 

Now work your way towards larger initial angles. If you switch from “solid” to “frame” you 
see three lines representing vectors in addition to to rotating solid. The green line is the angular 
velocity vector, uj. The vertical purple line and the red line are the components of the u vector 
along the directions of the angular momentum L and the symmetry axis e 3 . Thus these vectors are 
a visual representation of the fundamental relation between L, uj and e 3 , expressed in (9) and (13). 

I hope that these animations will help you to understand more deeply the connection between 
(13) and (15). The rate of rotation of the plane is always \L\/I±, which is fixed in the animations. 
One way of describing the crucial relation (13) is that we have taken the angular velocity vector 
apart into a component along e 3 that describes the rotation of the object around its symmetry 
axis and the vertical component L/I± that describes the rotation of the plane of the symmetry 
axis. Writing Co this way is great, because you should be able to see both components physically. 
Notice that for the prolate object, for an initial angle less than n/2, uj is longer than L/I. This 
is prolateness in action - the moment of inertial about the long axis is smaller than Jj_, so (15) 
implies that the magnitude of the angular momentum must be larger than \L\/I±. The spin around 
the axis and the spin around the axis of the angular momentum add positively to the total angular 
velocity. For an oblate object, the opposite is true. The angular velocity vector is shorter than 
\L\/I±, which means that the spin around the axis is in the opposite direction. But in this case 
as well, the plane of e 3 and u rotates about the fixed angular momentum, which has the physical 
effect that the symmetry axis precesses. It is worth staring at this animation for a long while. 

Vectors in the body frame 

The derivation we gave at the beginning of the lecture of the motion of the free symmetric top in 
the space frame is slick and (I hope) easy to understand, but limited to the case I\ = / 2 . While we 
will not solve the more general problem completely, it is interesting to set it up mathematically in 
the body frame, rotating along with this rigid body. At least, in this frame, the moments of inertia 
do not change. We will take the origin to be the center of mass in both the space frame and the 
body frame. We will also take our coordinate axes in the body frame to correspond to the three 
principal axes of the rigid body. Then the equations that govern the motion of the body take a fairly 
simple form. Of course, even if we can solve these equations, it doesn’t tell us everything, because 
it will still take some more work to understand how the body is moving in the space frame. But at 
least this is a start. 
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Our starting point will be the relation between the change of a vector with respect to the space 
frame and the change with respect to the rotating body frame. This is going to be important for 
other things as well, so we will pause to go over it slowly. 

Consider a vector A. We will call the rate of change of the vector with respect to the body 
frame 


5A 

~6t' 


(26) 


We have invented this funny symbolism to avoid getting this confused with the rate of change of 
the vector with respect to the space frame, for which we continue to use the conventional symbols, 


dA 

dt 


(27) 


By definition, if ^ = 0, the vector is fixed with respect to the body frame, and then we know 
that its change with respect to the space frame is simply u x A where uj is the angular velocity of 
the body. If ^ 7 ^ 0, then the change with respect to the space frame is the sum of this and the 
contribution from the body’s rotation: 


dA 

dt 


5A 

~5t 


T cu x A. 


(28) 


What I find confusing about (28) is that it seems to refer to two different frames — the space frame 
and the body frame. In which frame is it true? The answer is in either one! The quantities on 
both sides of the equation are vectors that can be expressed either in the space frame or in the body 
frame. The explicit coordinates of A and its derivatives and 0 will change when we go from one 
frame to another, but (28) is true in either. This is why I was careful to use the words “with respect 
to” rather than the word “in” in defining ^ an d 4?. These objects are the change of the vector 
A with respect to the space frame and the body frame respectively, but they can be described in 
either frame (or for that matter, any other coordinate system). 

There is something very subtle and interesting going on here which I often get confused about 
myself. It is worth trying to say this in a few different ways because it gets into some issues that 
are usually not discussed in enough detail. Bear with me, because I am still trying to figure out 
the best way to say this. In Newtonian mechanics, we tend to use the terms “coordinate system” 
and “frame” interchangeably, and for example talk about a “moving coordinate system.” This is 
potentially confusing because we get into the problems we have just been discussing. I think that 
a better way of saying things in nonrelativistic Newtonian mechanics is to talk about a coordinate 
system only as the system we use to describe the components of vectors at on particular time. A 
“frame” is then something that specifies what coordinate system to use at each time. If we try 
to do this consistently, we see that it makes no sense to talk about the time derivative of a vector 
without specifying the frame, because the components of the time derivative involve the difference 
between to components of the vector at two neighboring times which must be specified in the 
coordinate system that the frame tells you to use at those two different times. Any time we write 
a time derivative of a vector (or tensor), we must specify a frame. However, the result of 


6 



the differentiation in a particular frame at a particular time is a vector that has an invariant 
meaning, and can be described in any coordinate system at that time. 

Curiously, this almost comes more naturally in relativity. If we think of the notion of “frame” 
more physically, as we are forced to do once velocities get close to c, we are naturally constrained 
to do the right things, because we have to specify HOW we actually measure the coordinates of our 
vectors. Of course this means that we need to have a coordinate system that everyone in the frame 
agrees on at each different time. The fact that time gets mixed up with space in relativity forced 
us to this more physical picture of a frame, but in fact we really need it just to keep from getting 
confused even at low speeds, at least if we are thinking about rotating or otherwise accelerating 
frames. 

Let’s discuss an example. Consider the vector u itself. Here (28) implies 


duj S uj Suj 

—— — ~r~ T UJ X UJ = . 

dt St St 


(29) 


The rate of change of u with respect to the body frame is the same vector as the rate of change of 
omega with respect to the space frame. First consider the situation in which u = uih is constant in 
space. This means that the body is rotating about the fixed axis n with constant angular velocity uj. 
In the body frame, this axis is fixed in the body, and again uj is constant. Thus 


duo „ Suj 
—— = 0 =>- — 
dt St 


(30) 


But the relation (29) may seem a bit odd in general, so let’s see how it works in a nontrivial 
example. Considering the object you studied in problem 9.1 will give us something specific to 
think about and will also illustrate some of the curious features of angular velocity. 



rotation of axis 


/N 

/ \ 

/ \ 

/ 1 _ 

1 1 ^1 

t = 0 

rotation 

1 1 

around axis 

1 1 

space frame 


V / 

\ / 

\ / 


(31) 
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This is intended to show a rigid body made of two perpendicular rods rotating in the space frame 
about one of its axes e\ (the long one — red if you can see the figure in color) with angular velocity 

cci = (32) 

which is constant in the body frame and with the axis e.\ rotating about the vertical with angular 
velocity 

Co 2 = oo 2 z (33) 

constant in the space frame. The total angular velocity is the sum Co = Co\ + Co 2 and the component 
of the angular velocity in the direction of the axis is executing uniform circular motion and the rate 
of change of Co is 

—00 — U0 2 X 00 — 00 2 X (o0\ T" 002) — 00 2 X 00\ (34) 

(If 

In the space frame, as the red axis rotates around with angular velocity io 2 , you see the shorter 
crosspiece rotating around the red e 1 axis with angular velocity Co x . For oj] = 30 and 0 o 2 = 0, 
this is what was shown in the Mathematica notebook as9-rotations.nb that you studied on the 
problem set. 

The animation is based on the fact that our description in words of the angular velocity de¬ 
scribes not just what 00 looks like at a particular moment, but also how it evolves with time (at least 
if we impose an initial position). For example, if the axis e\ is in the x direction at time t — 0, then 
at an arbitrary time 

e 1 — x cos oo 2 t + y sin oo 2 t (35) 

Co 1 = oo\X cos oj 2 t + ooiy sin oo 2 t (36) 

Co — oo\xcosoo 2 t + oo\ysm.oo 2 t + oo 2 z (37) 

From this and (33), you can explicitly verify (34): 

d _ , ^ ^ ^ 

— 00 = — oo\oo 2 x sin oo 2 t + oo\oo 2 y cos oo 2 t = oo 2 z x [oo\x cos oo 2 t + oo\y sin oo 2 t) (38) 

LLL 

Now what does this look like in the body frame? The two frames are shown side by side in 
Mathematica notebook as9-rotations-body-frame.nb. Stare at the animation and imagine that 
you are moving around with the body and try to get a feeling for how you would see the blue line 
representing Co. Naturally, e\ and therefore Coi doesn’t change, because e.\ in defined to be fixed 
in the body frame. But Co 2 , which was fixed in the space frame, is not fixed in the body frame, 
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precisely because of the nonzero cu\, as shown below. 



( 39 ) 


In the body frame, at t = 0, the cu vector looks the same as it did in the space frame. But now at later 
times, the cu vector rotates around the red e\ axis with angular velocity —uu\, so the perpendicular 
component, cu 2 executes uniform circular motion about the red e\ axis with angular velocity —uj\ , 
and the rate of change of the vector cu is 

£ 

—l 0 = —tUi X OJ = — dll X (dll + UJ2) = —<^1 X du 2 (40) 

ot 

which is the same as (34). Again, you can verify this explicitly. In the body frame 

all = cu^x cu 2 — u 2 z cos coit + cu 2 y sin cu\t (41) 

cu = (jj\X -\- u 2 z cos u>\t to 2 y sin ui\t (42) 

d _ „ . „ ^ 

—u = — Uiuj 2 z smujit + t 0 iu 2 y coscuif = a; 2 x (43) 

dt 

Note that the actual functions of t in (42) (43) look different from those in (40) for t > 0, because 
the coordinate systems only agree for t — 0. But they are describing the same vector relation, 
(34). 

Before going on, let me just remind you that what u in the body frame is is NOT the angular 
velocity of the body in a coordinate system fixed in the body. That would of course be zero. Rather 
u in the body frame is the true angular velocity of the body in the space frame but described 
in terms the coordinates of the body frame. When I am talking about the body frame I will try 
to remember to refer to cu as the cu vector rather than the “angular velocity” to emphasize this 
important difference. 

The free symmetric top in the body frame 

I hope that it will not surprise you that our principal tool in analyzing the free symmetric top in the 
body frame will be the general relation between the change of a vector with respect to the space 
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frame and its change with respect to the body frame, (28) 

dA SA _ T 
dt St 

The other thing we need is the vector relation we derived in (13) and (10) between L, 0 and e 3 : 

0 = L/I± — fie 3 (13) 

with fi given by (10) 

n= (h-I±)u , 3 (10) 

I± 

This vector relation is true in any coordinate system. 

We can now calculate how 0 and L change in the body frame. We will do this in two ways. 
First we can use (13) and the fact we derived last time that with respect to the space frame the 
plane of 0 and e 3 is rotating about L with angular velocity L/I± to write 


SO dO L 
St dt I l 


(44) 


Again this is a vector equation true in any coordinate system, so we can use it in the space frame, 
together with (13) to write 


SO 

~5t 


(c o T fie; 


3 X U 


= 0x0 + fie 3 x 0 = fie 3 x 0 


(45) 

(46) 


But because e 3 is fixed in the body frame, (46) implies that with respect to the body frame, 0 is 
rotating about e 3 with angular velocity fie 3 . This is the same trick we used in the space frame, 
except that now it is e 3 that is fixed, so we want to think of the rotation as about e 3 . Note that (46) 
implies that u 3 is constant just as in the space frame because 


5 

St 


(e 3 ■ 0) = e 3 • = e 3 • (fie 


St 


x 0) = 0, 


(47) 


so we can independently derive the fact that the coefficients in in the vector equations (13) are 
fixed. 

Alternatively, we can use the fact that L is fixed in the space frame and write 


„ SL ^ 

0 — ——b lu x L , 
ot 


so that 


SL 

It 


= —0 x L = 


{L/Ia 


fie 3 ) x L = fie 3 x L 


(48) 


(49) 


This expresses the same physics. With respect to the body frame, L, like 0 must rotate about the 
e 3 axis with angular velocity fie 3 because the entire plane in which 0 and L are fixed is rotating. 
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There is a nice way of understanding all this that may help to cement one of the important 
lessons of the last few weeks — while rotations are complicated and do not add like ordinary 
vectors, infinitesimal rotations and angular velocities do add. In this case, what we have with 
respect to the body frame is that the plane formed by the three vectors L, ti and e 3 rotates with 
angular velocity 

ti pb = Qe 3 . (50) 

With respect to the space frame, the plane formed by the three vectors L, to and e 3 rotates with 
angular velocity 

tips = L/ Jj_ (51) 

The relation between the space frame and the body frame is that the angular velocity of the plane 
with respect to the space frame, ti ps , is the angular velocity of the plane with respect to the body 
frame, ti pb , plus the angular velocity of the body, ti\ 

UJ ps (dpb T ^ (^^) 

Indeed, because of (13), this works. Angular velocities add like ordinary vectors. 

Euler’s equations 

There is another approach to the problem of the free symmetric top, which is also more generally 
useful. We can derive what are called “Euler’s equations” by considering (28) where A is the 
angular momentum, L. We can then use the fact that the torque, r, is the rate of change of angular 
momentum with respect to the space frame. Using (28), we can write this in terms of the rate of 
change of L with respect to the body frame. Then we can write 

_ (1L SL ^ -> 

T = Tt = Tt +bJxL <53) 

As we discussed last time, we can evaluate a vector equation in any coordinate system. It is 
interesting to consider (53) in the body frame in which / is fixed. Then in the usual basis in which 
we use the principal axes of our body, e\ e 3 and e 3 , the moment of inertia tensor is 

_ (h 0 0\ 

I= o I 2 0 (54) 

Vo o ij 

and we can express the components of L in this basis in terms of the components of ti as 

L\ = I] uj\ L 2 = 1-2^2 L 3 = I 3 cc 3 (55) 
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We now expand the cross product in (53) to get 


T\ = 


T2 = 


T 3 = 


SL i 

1st 

SL 2 

St 

sl 3 

St 


+ lu 2 L 3 — lu 3 L 2 
+ CU 3 Li — u>i L 3 
+ L 2 — uj 2 L\ 


Now use (55), L 1 — h , L 2 = I 2 u 2 , L 3 


J 3 u :i , to write 


( 56 ) 


Ti — + CU 2 / 3 CU 3 — Ul 3 J 2 CU 2 

t”2 = h —rr + <^3 h — <^1 -^3 <^3 
St 

t 3 — h + <^1 7 2 ^2 — ^2 A <^i 


(57) 


or simplifying slightly 

(h ~ I3) <^2 ^3 

(I 3 ~ h) U 3 cui (~^) 

(Jl — J 2 ) CUi cu 2 

These are Euler’s equations, relating the rate of change of the angular velocity in the body frame to 
the torque and angular velocity in the body frame and the principal moments of inertia. We might 
have expected that the right hand side would be proportional to products of two components of u. 
This ensures that if the angular velocity is in the direction of one of the principal axes, the right 
hand terms of all the Euler equations vanish because two of the components of u are zero. And 
then if the torque vanishes, SO/St = 0, because as we know, the body can rotate freely about a 
principal axis. 


n = h 


Suj\ 

1st 


Su 2 

Su) 3 

T3 = h ir- 
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The free symmetric top ala Euler 

For the free symmetric top, floating out in space and rotating freely about its center of mass with 
h — h — I±, the Euler’s equations take the following form: 


Sui 

■ IF 

5u2 

■ It 
Su, 3 
St 


— (I3 — I±) U3U2 
(I3 — I±) U3UJI 
0 


(59) 


We immediately conclude that u>>, the component of angular velocity in the direction of the unequal 
principal axis, is constant in time relative to the body frame. With then fixed, the other two 
equations take the form 


5uoi 

IF 


(1 3 — I±) ^>3 

F 


CU2 — —1 - 0J2 


Su 2 

~sF 


(h — I±) ^>3 


0J\ — U\ 


(60) 


where 

o _ ih - I±) u 3 
I± 

and 0 is a constant. 

We know that we can rewrite (60) as 


(61) 


- = (0,0,fl)xw (62) 

ot 

In agreement with the previous analysis in (46), this is what we expect for a vector 0 undergoing 
uniform circular motion with angular velocity about the 3 axis. 

You can also see this in a more boring way by solving the coupled differential equations. If we 
put the second of the two equations in (60) into the first, we get an equation of motion for u\. 


5 2 u 1 

~w 


-f ) 2 UJ\ 


(63) 


You can easily check that u; 2 satisfies the same equation. (63) is just the equation for simple 
harmonic motion, with the general solution 


(t) = acos(f2f + 0) 


(64) 
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Then (60) implies 


(65) 


a ) 2 (t) = a sin (Qt + (ft) 

As usual, a and (ft must be set by the initial conditions. So, as we saw directly from (62), (64) and 
(65) describe the two-dimensional vector, (coi{t), co 2 (t)^) undergoing uniform circular motion with 
angular velocity Q. Note that the sign of Q is related to the sign of u 3 , and also whether I_ > J 3 
(which is what we would get for a prolate spheroid) or I± < I 3 (for an oblate spheroid like the 
earth). 

Thus the angular velocity in the body frame is fixed in magnitude, but changes direction, mov¬ 
ing with constant angular velocity around a cone. 

The tennis racket theorem 

Euler’s equations can also be used to understand an interesting fact about rotating rigid bodies with 
no two equal moments of inertia. I’m a tennis player, so the tennis racket theorem is a favorite of 
mine. The tennis racket, in this case, is just a convenient example of a rigid body with three unequal 
moments of inertia. We know that a tennis racket, like any rigid body, is happy to rotate freely about 
any one of its principal axes. For such a rotation, the angular momentum and the angular velocity 
vector are in the same direction. The theorem concerns the nature of free rotations (no torques) that 
begin with an angular velocity almost, but not quite, along one of the principal axes. The statement 
of the theorem is that if the axis has the largest or the smallest moment of inertia, the motion is 
stable and the angular velocity precesses about the axis, never getting very far away. However, 
if the axis has the intermediate moment of inertia, the motion is unstable. The angular velocity 
moves away from the axis exponentially and the motion becomes rather complicated. Here is how 
it goes. Suppose that we consider motion in which u in the body frame is nearly along principal 
axis 3, so that in our nice body frame, u\ and uj 2 are much smaller than u 3 . Euler’s equations look 

= (h — h) W 2 ^3 

= (I 3 -I 1 )co 3 u 1 (66) 

= ( 1 1 — I2) UJ\ 

For the assumed initial conditions, we can generally ignore the time dependence of u 3 , because 
its time derivative is proportional to the product of two small numbers, and u 2 . This argument 
could go wrong if I 3 is very close to either I\ or J 2 , but for a tennis racket in which the three 
principal moments are very different, it is a good approximation. 

If we then treat uj 3 as a constant, the other two Euler equations are linear in the small quantities, 


5uj 2 
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uj\ and u> 2 , and we can write them (approximately) as. 


Differentiate the first of these 

5 2 uji 1 

~W = h 

Now here is the point. If J 3 is 


6uJl - 1 (T T \ 

tt — y (-*2 — h ) ^3 ^2 

dt 1 1 

^2 l , 

-ft — ~r yh — h)u 3 u>i 

dt 1 2 

and use the second to get 

(J 2 — I3) cu 3 -Y = yy — -^i) (-^2 — h) 

dt ii -/ 2 


the largest or smallest moment, then 


CUg UJ\ 


( 67 ) 


( 68 ) 


r \ (/ 3-/,)(/ 2 -/ 3 )^<0 ( 69 ) 

2122 

But then (68) describes an oscillation, aq oscillates about zero and remains small. The same thing 
happens to u 2 . In this case, the motion of the system remains basically about axis 3 with small 
oscillating wobbles. However if J 3 is the intermediate moment, then 


hh 


{h~h){h-h)<4> 0 


(70) 


Then (68) describes an unstable equilibrium and uj\ and u> 2 grow exponentially. Then our approx¬ 
imation quickly breaks down. We can’t tell what happens next without a more detailed analysis. 
But what is clear is that the motion around the axis with the intermediate moment of inertia is 
much more complicated. You can easily see this yourself by playing with a tennis racket, or any 
similarly shaped object. 
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lecture 22 


Topics: 

Where are we now? 

Derivation of fictitious forces 
Precession of tops 
Nutation 
Tidal forces 
Extra Dimensions 

Where are we now? 

This week, the general subject is gravity (and more generally “central” forces) and “fictitious 
forces.” Fictitious forces are convenient constructions that allow us to continue to use F = ma 
in non-inertial frames of reference. This is actually familiar to all of us. Our brain makes the 
same kind of construction whenever we turn a corner in a moving car and we feel a centrifugal 
force. Gravity is also familiar, but seems rather different. Einstein however, would tell us that the 
difference between the two is not so obvious as it seems. It is not an accident, he would say, that 
both forces are proportional to the mass of the particle on which they act. We will begin to see 
why this is important when we talk about the tides. And it will come back again later when we talk 
about cosmology. 

We will probably not get to everthing in the notes in the lecture today, but some of it is better 
to just read about. I will try to take a little time at the end to talk a bit about the subject of “extra 
dimensions” and the connection with gravity, because this crazy idea is sort of fun and something 
that many people are interested in today. 

Derivation of fictitious forces 

The crucial input to the study of fictitious forces is that the motion of an arbitrary accelerating 
frame, like an arbitrarily moving rigid body, can be described at any time by the motion of its 
origin, R, and by its angular velocity of rotation about the origin, uj. Many of the same ideas that 
go into thinking about the physics of rotating rigid bodies can be taken over directly to the study 
of rotating reference frames. The frame of reference in which the rigid body is at rest is obviously 
a good one to think about when discussing a rigid body, and if the body is rotating, then of course 
this frame is rotating too. 

The basic idea of fictitious forces is this. If I insist on using an accelerating and/or rotating 
reference frame, then particles that are moving at constant velocity in free space will appear to be 
accelerating. In order to make F = ma work in such an non-inertial frame, we simply interpret 
this acceleration as being due to a force, called the fictitious force. More specifically, we break m a 
up into a piece associated with the frame and all the rest and move the “frame” part to the other 
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side of the equation, treating it as a force: 
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The fictitious force is always proportional to the mass of the particle, because of the min F — m a. 

There were two basic kinds of questions about this on the QA. Do we really have all the 
fictional forces? AND Why do we call them “fictional”? These are closely related, it turns out. 
We call these forces “fictional” because they can be eliminated completely everywhere by going 
to an appropriate reference frame. If they can be completely elimnated in this way, then in some 
sense they are just due to our poor choice of reference, and not to any “real” physics. But this is a 
little tricky. The earth’s gravity is definitely “real” even though we don’t feel it if we are in a space 
ship in orbit above the earth. What is the difference between a fictitious force and gravity? Both 
of them give a force on a point particle proportional to the particle’s mass. This is why we don’t 
feel gravity in orbit. It’s effect is canceled by the fictitous translational force. In fact, the only way 
we can tell the difference is by looking at that form of the force everywhere. When we define 
a coordinate system, we are laying down a grid that allows us (theoretically - we can’t really do 
this of course) to find the coordinates of any point anywhere in space (remember that we are doing 
Newtonian mechanics and ignoring general relativity). Then we define our frame by specifying 
the coordinate systen at each time. Each coordinate system is a rigid thing — not physically, but 
theoretically. So like a rigid body, it is specified by only six parameters, the three coordinates of 
the origin, and the three angles that determine the orientation of the coordinate axes. This is why 
we know that we have all the fictitious forces. We know from our study of rigid bodies that the 
most general motion of the rigid coordinate system is equivalent to a translation and a rotation, 
so we know the form of our fictitious forces everywhere and we can eliminate them everywhere 
by going to an appropriate frame. Gravity doesn’t work this way. We can eliminate gravity at 
any point by going to an accelerating coordinate frame in which our coordinate system is “falling” 
along with us. But this cancellation of gravity only works at one point. That is why we say that 
gravity is real. 

Relativity (as usual) makes things even more complicated. Once we realize that gravity curves 
space the whole notion of coordinate system is more complicated. But as long as we say away 
from black holes and that sort of thing, we can coninue to use the Newtonian picture at least 
approximately, and that is what we will do. 

Here is a brief derivation of the fictitious forces. It is equivalent to what you will read about 
in Morin’s book, but it breaks the transformation to the accelerating coordinate system up into two 
steps, for me at least, this makes it a little simpler to understand. For a coordinate system that 
is rotating with angular velocity Co about its origin and whose origin is translated by a vector R 
(where Co and R have arbitrary time dependence) with respect to some fixed point in an inertial 
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frame, the fictitious force is the sum of four terms: 


Centrifugal —mu x (u x r ) 

Coriolis —2mCo x v 


Azimuthal 


-mu x r 


Translational 


—m 


d 2 R 
dt 2 


( 3 ) 


As I said, I think it is easiest to demonstrate (3) in two steps. To go from a coordinate system 
fixed in space to one undergoing arbitrary acceleration and rotation, first go from the fixed coordi¬ 
nate system to a translating coordinate system that is not rotating with respect to the fixed system, 
but with its origin accelerating along with the moving coordinate system. We want to know what 
happens to F — ma when we do this. Let’s call the original space frame force and coordinates F s 
and r s , and the translating frame force and coordinates F t and r, so that we begin with 


Then the new coordinates are 


w d2 - 

F , = m- 2 T, 


r = r s — R 


(4) 

(5) 


and we can write 


So that 


d 2 


""dt 27, m dt 2 


d 2 p d 2 

( r s — R I = r s — m 


dt 2 


d 2 


d 2 


Ft = F* — m R = m r 


dt 2 


dt 2 


( 6 ) 

(7) 


Thus the equation of motion in the translating coordinate system is the same as in the space frame, 
except that the force has a new piece - the fictitious translational force. 

It may help to see the relation (5) represented pictorially. In the diagram below the solid axes 
and arrow represent the coordinates of a point in the space frame, r s . The dashed axes and arrow 
represent the coordinates of a point in the translating frame, r. The dotted arrow is R, which as 
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you can see describes the coordinates of the origin of the translating frame in the space frame. 



We can now do the rest of the job, and go from a coordinate system (that we will call the 
translating frame) to another with the same origin, but that is rotating arbitrarily. We will continue 
to call 

d 

i (9) 

the derivative that describes how vectors change with respect to translating frame and will indicate 
the rate of change of a vector in the arbitrary frame by 5/5t as we did in our study of rigid bodies. 
Now we can use the relation that is familiar from our study of rigid bodies, 


dA 

dt 


SA _ T 
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Let A = f and (10 ) becomes 


Now we can differentiate again in 
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the translating frame and repeatedly make use of (11) 
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so finally 


d 2 r S 2 r _ ^ Scu _ 
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Now we will replace the 5/Sts by dots just to make it look simpler. 

d 2 f 


dt 2 


= r + 2u x v + lu x r + lu x (lu x r) 


(15) 


(16) 


where we have used the fact that v — f - the velocity with respect to the arbitrary frame. 

Finally we just have to put (16) into (7) and move the “fictitious” terms to the other side to get 
the equation of motion in the arbitrary frame, 


F a — mf 


where 


d 2 


F n — F„ — rn lu x (lu x r) — 2m luxv — mCuxr — m —— R , 
v ; dt 2 


(17) 


(18) 


which is what we mean by (3 ). We we spend a lot of time next week discussing the physics of each 
of the terms in ( 18). 


Precession of tops 

Let’s first see how precession works in quantitative detail. The system is illustrated below: 



(19) 


Suppose that at some angle 6, the torque is r — into the paper in the figure. We will actually 
ignore the dependence of the torque on 6 and simply assume that the magnitude of the torque is 
constant and equal to r. This will allow us to understand what is happening more easily, and it 
does not change the qualitative nature of the physics, just some boring details. 

Now we will first consider a motion in which the top is started with exactly the right preces- 
sional velocity to keep it at the angle 6. In this case, the angular velocity has two contributions. 


5 




There is a large contribution, u s , in the direction of the symmetry axis of the top — because the top 
is rapidly spinning. There is a vertical contribution, u p , due to the precession caused by the torque. 
Typically, for a rapidly rotating top, the torque is small in the sense that the precession frequency 
Up is much smaller than the angular velocity u s associated with the spin of the top. 



( 20 ) 


From (20), we see that the component of angular momentum in the direction of the symmetry 
axis is 

L\\ = I 3 U 3 ( 21 ) 

where u 3 is the total component of angular velocity along the symmetry axis. 


u 3 = u s + u p cos 9 (22) 

The quantity u 3 is actually somewhat more interesting than u s because it is conserved if the tip of 
the top is frictionless. The component of angular momentum perpendicular to the symmetry axis 
is 

L± = I l u p sin 9 (23) 

What we care about for the precession is the horizontal component of angular momentum, 
which is 

L x = sin 9 Ln — cos 9 L j_ = sin 91 3 u 3 — sin 9 cos 9I± u p (24) 

Normally, we ignore the small second term on the right hand side of (24), and do not distinguish 
between u s and u 3 . But I just wanted to show you that it is possible to take into account the con¬ 
tribution from the precession if we want to. At any rate, the rate of change of angular momentum 
for uniform precessional motion is u p L x , and thus the condition for uniform precession is 

t = u p L x = u p (sin 9 1 3 u 3 — sin 9 cos 9 1± u p ) (25) 


This is a quadratic equation for u p with solutions 


Up 


hu>3 

2 J|_ cos 9 \ 


hu 3 
2/i cos 9 
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r 

sin 9 cos 91± 


(26) 
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The plus sign here gives what is called fast precession, which is precession of the order of the 
rotational velocity of the top. If the torque is small, we can write it as 



(27) 


This is just a slightly perturbed version of the free rotation of a symmetric top that we have just 
understood in detail — the torque plays very little role in it. This is not what we usually see, 
because the top usually doesn’t get started moving that fast. Instead, we are interested in slow 
precession, the minus sign solution, which gives 



(28) 


This is the usual result in which the precession frequency is proportional to the torque and inversely 
proportional to the angular momentum. 


Nutation 


So if the top is moving at exactly the right angular velocity, it precesses at constant angular velocity 
and stays at the same angle. But what happens if things are not quite right? In particular, suppose 
that we are holding the top at some fixed angle and suddenly drop it. This is a typical situation. 
What happens? 

The best way to understand what happens is to go to an accelerating frame that is rotating 
around the pivot point of the top with angular velocity u p . This is not an inertial frame, of course, 
which means that there are so-called fictitious forces that make up for the fact that the frame is 
accelerating. We will talk about these in detail later. At any rate, in this frame, the motion with 
constant 9 and precession rate u p just looks like a top with its symmetry axis sitting still in space. 
This means that in this frame there must be a fictitious force that exactly cancels the torque, so that 
the angular momentum is conserved. I presume that this a basically a coriolis force, but it doesn’t 
matter what it is because it must be there. 

Now what does the process look like in this rotating frame if we drop the top from rest in the 
space frame? In the moving frame, because the symmetry axis of the top was initially at rest in 
the space frame, it is initially rotating with angular velocity — u p in the rotating frame. But that 
means that there is a small additional component to the angular velocity in the vertical direction, 
and thus the angular velocity and the angular momentum of the top are slightly displaced from the 
symmetry axis. Because there is no torque in this special frame, this just reduces the free rotation 
problem that we have already analyzed. The symmetry axis of the top precesses rapidly around the 
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angular momentum with frequency L/I±. Back in the space frame, this motion is superimposed 
on the generally much slower precession of the angular momentum produced by the torque. The 
rapid motion is called nutation. 

Not only is the nutational motion very rapid, but the amplitude is usually very small. In the 
case we discussed in which the top is dropped from rest, the angle of the angular velocity in the 
special frame to the symmetry axis is of order u p /los. For a rapidly rotating top, the motion is often 
to small and too rapid to see. But you can feel it or hear it under the right circumstances. 

As I warned you I would, I have ignored the dependence of the torque on 6 in discussing 
nutation. I hope that you can now see why this doesn’t make much difference. The torque is a 
very small effect that produces the slow precession of the plane of e 3 . The nutation results if the 
angular momentum is not quite lined up with the symmetry axis. These two effects have very little 
to do with one another. Small changes in the torque as e 3 precesses rapidly about L will have a 
very small effect on the motion. 

Tidal forces 

The phenomenon of tidal forces is rather special to gravitation. It is also both physically important 
and a nice example of the Taylor expansion with several variables, so I can’t resist talking about 
it (though I might skip it or put it off til next week if we get behind). The idea is to ask what the 
gravity of a distant object does to a mass on the surface of an approximately spherical planet like 
the earth (here we ignore the oblateness, which has only a tiny effect on this physics). Suppose 
the center of the sphere is at a point r 0 . Now by Newton’s theorem, we know that the gravitational 
force from a distant body on a mass m at r is given by 

— G Mm (29) 

\ r ~ P\ 

where p and M are the position and mass of the distant body, so you might think that all we have 
to do is to evaluate (29) for r on the surface of a sphere, given by \r — r 0 | = R, where R is the 
radius of the sphere. But that is not quite what we are interested in for the gravitational force. The 
gravitational force acts not only on the mass m, but also on the planet, where the forces acts on 
the center. What we are actually interested in is the accelerated coordinate system that is moving 
along with the planet, with the center r 0 fixed. Thus we must add to (29), the fictitious translational 
force associated with the acceleration of the planet due to gravity. The acceleration is 

- G M (30) 

To ~ PV 

and the fictitious translational force is 

GMm (31) 

To ~ PV 

The interesting thing is that this is trying to cancel the effect of (29) and we are only interested in 
the difference, which is very small. 

Ftidai = —G M m + GMm (32) 

\r — p\ 6 |r 0 — p\ 6 
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Since in this accelerating coordinate system, r 0 is fixed, we might as well take it to be the 
origin of our coordinate system, so we take r 0 = 0. This makes the formulas simpler. For this 
more convenient choice, (32) becomes (moving some signs around to make things look simpler) 


^tidai = GMm 


p — r 
\p-f\ 


GMm^- 

\p\ 3 


(33) 


Now we can use the fact that \r | -C \p\ and get an even simpler expression by Taylor expanding 
the first term in powers of r. This is easier if we first have some fun with vector calculus. We can 
write 

^tidai = ~G Mm V p ^ - |^| j (34) 

Where V p is the gradient operator for the vector variable p , 



Odd 

dp x ’ dp y ’ dp z 


(35) 


Equation (34) may look scary, but actually it is the usual connection between a 1/r 2 force and a 
1/r potential. We get the force by taking minus the gradient of the potential. In this case, we are 
interested the difference between two forces - the real one from gravity and the fictitious one from 
the earth’s acceleration, and that is minus the gradient of the difference between two potential. In 
fact, this is a nice example of how a little bit of knowledge of vector calculus can simplify your 
life a lot. If you had to apply the Taylor expansion separately to each component of (33), it would 
be at least three times as much work and the result would be more cumbersome to write down. 

Now we can Taylor expand the term in parentheses in (34), before we take the gradient - and 
this is a lot easier 


1 

I P~r 




2 r 


p + r 2 ^j 


- 1/2 



1 r-p 1 \ 

¥\ + W + '"^W\) 


(36) 


The lowest order term in the Taylor expansion of (36) cancels. That is the point — a constant 
gravitational force does nothing if one is falling along with it. The first order term is 


( 1 1 \ r-p 

vW-^T" ¥\) “ W 

Putting this back into (34) gives 


-Ptidai = -GM mV p 


r • p 

w. 


Acting on the r in the numerator, the gradient operator in (38 ) gives 


(37) 


(38) 


— G M m 



(39) 
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while acting on the denominator it gives 


G M m 


3 p(r-p) 
IpT 


(40) 


Thus 


-^tidal 


G M m 
P 5 


3(r • p) p — p 2 r 


G M m 
P 3 


3 (r ■ p) p — r 


(41) 


What does this mean? Remember that we are using a coordinate system with the center of the 
earth as the origin. Thus r is a vector from the center of the earth — that is it is straight up out of 
the ground. And p is a vector from the earth to the body producing the tidal force. Notice that the 
sign p in (41) doesn’t matter. This slightly unintuitive result is one of the more amusing features 
of the tidal force. It is the same on the side closest to the body producing it as on the other side. 
We will come back to this and try to understand it qualitatively in a moment. 

If r and p are parallel, the force is in the positive f direction — which means outward. If r and 
p are perpendicular, it is in the negative f — which means inward. Elsewhere, it is not parallel to 
f. For 

3(r • p) 2 = p 2 r 2 (42) 


when the cosine of the angle between r and p is 1/ \/3, it is perpendicular to f (the angle is about 
55°). 

To get a more quantitative description, let us take p in the z direction and look in the x-z plane 
through the center of the earth. Then we can write 


f = cos 9 z + sin 9 x 


(43) 


and 

Ftidai oc 3(p ■ r) p — r — 3(5 • f ) z — r — 2 cos 9 z — sin 9 x (44) 

It looks something like this (this is a slice — I have tried to show what it looks like in 3D in 
the Mathemcitica file tides.nb, but I am not sure this has been a real success): 



( 45 ) 
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You can see how this produces the tides, pulling the ocean towards and directly away from the sun 
or moon. 

Let’s see if we can understand in our bones why it is that the tidal force is outward, away 
from the center of the earth, both on the side of the earth closest to the sun, and on the side farthest 
away. 1 The point is that there are two effects — the gravitational force of the sun, and the “fictitious 
force”, that is the effect of the accelerating coordinate system. At the center of the earth, the two 
effects cancel, the 1/r 2 nature of the gravitational force implies that on the side close to the sun, 
the gravitational attraction wins and the net tidal force is towards the sun, while on the other side, 
because the gravitational force is weaker, the fictitious force wins, and the tidal force is away from 
the sun. 

This rather odd looking tidal force is characteristic of gravity in free space. It depends crucially 
on the remarkable feature of gravity that the same quantity — mass — appears in the strength of 
the force and in the inertial term in Newton’s law. It is this that causes us to subtract the force on 
the center of the earth, to produce what we see in (45 ). It was this bit of magic that gave Einstein 
the clue to construct gravity as a geometric theory. 

Extra Dimensions 

Extra dimensions are fun. We probably won’t have too much time to discuss this in lecture today, 
so I thought I would refer you to an amusing journalistic description of what is called the brane 
world picture. 2 A braneworld is a modern version of flatland. There are two ideas. The first is this. 
Suppose that there are space dimensions beyond the usual three. It doesn’t make any sense that 
they are just like the usual three because we would know about them, so suppose that while these 
extra dimensions are there, we are stuck on the usual three. All the matter that we are made of, and 
the electric (and strong and weak) forces that make us work the way we do, everything is confined 
to the usual three dimensions. Now you should object that we have done nothing at all by positing 
the existence of the extra dimensions, and you would be right. Such extra dimensions would be 
uninteresting, because we would never see them. But let’s try to make something interesting about 
the fact that we have mentioned before that gravity is much weaker than the other forces. Perhaps 
matter and the non-gravitational forces - electromagnetism, the strong force and so on, are confined 
to the usual three dimensions, but gravity gets out into the extra dimensions. This is potentially 
interesting. 

What we mean precisely by saying that gravity gets out into the extra dimensions is that the 
field lines of gravity are not confined to our three dimensional space, but spread out into the extra 
dimensions as well. 

This is the basic picture of a “brane-world.” The term “brane” is sort of a generalization of 
“membrane” and it refers to the usual three space dimensions living in a higher dimensional space. 
Gravity lives in the full higher dimensional space because it is somehow related to the geometry 
(this is a bit vague, I know). 

'The argument is a little easier to give for the sun, so we will do that, though the moon works the same way. 

2 If the link doesn’t work for you, I have included the file on the web page as braneworlds.pdf. 
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But now you should object for another reason. If the field lines of gravity spread out into the 
extra dimensions, you would expect the force to fall off faster than 1/r 2 as the distance increases. 
But we know that gravity is a 1/r 2 force, don’t we? 

This is where the second part of the brane-world idea comes in. The extra dimensions are 
curled up. A useful picture of this space is a soda straw with a line on it, as shown below 


The thick black line is intended to represent the brane on which we live - that is ordinary three 
dimensional space. Of course, the line is only one dimensional, but if we had more dimensions to 
draw in, we could include the other two. The important thing is that these dimensions are flat and 
infinite. We have only shown a segment of the infinitely long straw. The other dimension - around 
the straw - represents the extra dimensions. Again, there might be more of them. But the important 
thing is that these extra dimensions are curled up and finite. You have to remember when you stare 
at this that we have shown just one real and one extra dimension, to make it easy to visualize. But 
otherwise, it is a really helpful picture. 

Now suppose we put two masses on the brane - representing two masses in our three dimen¬ 
sional world. The situation is very different depending on whether the distance r between the 
masses is large or small compared to I), the diameter of the extra dimensions. 


\r\ 



For r <C D, the field lines spread out into the extra dimensions. On this small scale, the multi¬ 
dimensional surface of the soda straw looks approximately flat, and gravity looks n dimensional. 
The force falls off like 1 /r n_ 1 . 



field lines uniformly spread 
over extra dimensions 


For r D, the field lines spread out uniformly over extra the extra dimensions once you 
get a distance much greater than D from the masses. Then the force looks three dimensional, 
proportional to 1/r 2 . 

One of the reasons that this may be interesting is that it provides an explanation for the weak¬ 
ness of gravity. Gravity is week for masses on our brane world not because it is weak at short 
distances, but because it is diluted by being spread over the “large” extra dimensions. What “large” 
might mean is anybody’s guess. But it could certainly show up at the LHC if we are lucky. 
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lecture 24 


Topics: 

Disclaimer 

The Cosmological Principle and the Hubble expansion 
Newton’s theorem and the expansion of the Universe 
Hubble dynamics 
Dynamics of a flat universe 
Newton’s theorem and dark matter 

Disclaimer 

I want spend a few lectures on cosmology. This is not an area that I have worked in, so I do not 
know it in my bones the way I know particle physics. I will do my best to get across the big ideas, 
but we won’t have to go very deeply into things before my understanding starts to get spotty. I’ll 
try to remember to warn you when this is happening. I am certainly in no position to give the latest 
values of interesting quantities like age of the universe or the Hubble constant from a position of 
deep knowledge. When I do quote such a number, I will simply take it from the web page of an old 
friend of mine, Ned Wright, an astrophysicist at UCLA and a leader on the COBE Collaboration. 
Ned Wright’s Cosmology page is a fun site, and I recommend for poking around. What I am going 
to focus on is things that are directly connected to the stuff we have been talking about in this 
course. 

For the most part, we are going to focus of things that we can say without getting very rela¬ 
tivistic. The main reason for this is that the discussion can be much simpler when we can avoid 
confusing issues like relativity of simultanaity. To do things right would require general relativity 
(which we will not discuss, though we will take one result when we need it). And it would take so 
much time to get all the definitions straight that we wouldn’t have time left for the fun physics. 

At the end of the last lecture on cosmology, which will be the last lecture in the course, I will 
return to the question that we talked about at the beginning of the course. Why is Newton’s law 
a formula for acceleration, rather than something else. We will then be in a position to give a 
provisional answer to this question, or at least to relate it to another question, which in my view is 
one of the central mysteries of the universe. 

The Cosmological Principle and the Hubble expansion 

The “Cosmological Principle” is the assumption that when we average over sufficiently large 
scales, every place in the universe looks the same as every other place — there is no privileged 
position — no “center” of the universe; and that all directions are equivalent — the universe is 
isotropic. Ugh! This is Philosophy. And besides, I don’t believe it. One of the nice things that 
has happened in recent years is that we can now at least imagine how the cosmological principle 
might arise approximately from more physical principles. We will come back to this later, and also 
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discuss the experimental status of the cosmological principle. For now, as cosmologists have done 
for a long time, we will simply assume the cosmological principle to be true. 

When astronomers look at distant galaxies, they see them, on the average, moving away from 
us — receding. It is believed that the recession velocity is roughly proportional to the distance 
from us, at least until we go out far enough that the recession velocities are not small compared to 
the speed of light. This proportionality of recession velocity to distance is called the Hubble law. 

Let me give a very impressionist view of the observational evidence for this. The measurement 
of the recession velocity is not so difficult because astronomers can observe the Doppler shifts in 
spectral lines. Spectral lines are sharp peaks in the frequency spectrum of light (or other elec¬ 
tromagnetic radiation). They are associated with quantum mechanical jumps between particular 
quantum states of atoms or molecules, and appear on top of the continuum spectrum emitted by 
a hot object. The frequencies of these lines are determined just by quantum mechanics, so we 
know what they are in the rest frame. Here is a spectrum for a quasar (which is presumably just 
a distant galaxy doing strange things before its center settled down) spectrum, from a preprint in 
1996, W. Zheng, et. al, “A Composite HST Spectrum of Quasars,” arXiv:astro-ph/9608198. 



Figure 1: Composite FOS spectrum of 101 quasars, binned to 2°A. Prominent emission lines and 
the Lyman limit are labeled, and two possible emission features are marked. The continuum fitting 
windows are marked with the bars near the bottom. 

To lowest order in the recession velocity in units with c = 1, the fractional Doppler red-shift in 
the light that reaches us from a receding object is just 1 minus the component of velocity directly 
away from us. Here is a quick derivation. The diagrams show light coming from a distant galaxy 
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reaching us in two frames — our frame and the rest frame of the galaxy. 
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Also shown are the 4-momenta of an object on earth of mass m and an object in the galaxy of mass 
M} Using the invariance of the invariant product 
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7(1 + v \\. 

This is just what we want to know to verify the Hubble law. 

But the interpretation of the observations is not trivial because we do not have accurate ways of 
measuring the distances to far-away galaxies. For things that are nearby on an astronomical scale, 
we can measure the change in angle in the line of sight (compared to that for far away objects) 
as the earth moves around the sun, and use trigonometry to relate the distance of the object to the 
earth-sun distance (this is called parallax). Check out The ABC’s of Distances for adiscussion of 
parallax in Ned Wright’s Cosmology Tutorial. 

But for more distant objects, we can only infer their distance if they are or they contain objects 
whose intrinsic brightness we think we know. Such things are called “standard candles.” One 
of the keys to this business is to find good standard candles. If we see distant standard candles, 
and measure their observed brightness, we can calculate their distance by comparing the observed 
brightness with the intrinsic brightness. In principle this should work, but in practice, it is a much 
chancier proposition than using trigonometry. It will be accurate if we have a good theory of the 
intrinsic brightness and if the observed brightness is not affected by junk in the universe between 
the object and us. We will come back to this next week. 

Hubble’s original data (figure 2) was quite poor, but these days the Hubble law is known to 
work well out to distance of hundreds of megaparsecs, where a parsec is about 3.26 light years. 
Figure 3 shows some data using Type la Supernovas as a standard candle from Riess, Press and 
Kirshner (1996), quoted in Ned Wright’s Cosmology Tutorial 

Another reason that we believe the Hubble law is not experimental, but theoretical — it is the 
only possibility consistent with the cosmological principle. Let us see why this is so. We will do 


'The masses will cancel out, and it would be more elegant to use the 4-velocity 4-vector, which is the 4-momentum 
divided by the mass — that since that is one more definition to remember, and we don’t really need it, I have not spent 
much time discussing it. 
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Luminosity Distance [Mpc] 


Figure 2: Original Hubble data (1996) 



Figure 3: A Hubble plot using Type la SNe as a standard candle from Riess, Press and Kirshner 
(1996) 
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this assuming that the velocities are small compared to the speed of light — this will be valid in 
some relatively large region around the origin, as we will see. Then we can do the whole analysis at 
some fixed time t, and not worry about the changes in time that occur in Lorentz transformations. 

The way the proof works is that we begin by writing the law that relates velocity and position 
in a particular frame, defined by having the galaxy at the origin of the coordinate system at rest. 
We look at a galaxy at position r 0 and find that it is moving with velocity v 0 . Then we transform 
to a new frame in which this galaxy is at the origin and at rest, and we demand that the law has the 
same form. 

Here is the proof. Suppose that at some time t there is some law that specifies the velocity of 
each galaxy in terms of its position. This law can be expressed in terms of a function H(r) which 
gives the vector velocity v of the galaxy at position r in the inertial frame in which the galaxy at 
r = 0 is at rest: 

v = H{f) (3) 

Now suppose that we go an inertial frame in which the galaxy at r 0 is at rest and go to a coordinate 
system in which this galaxy is at the origin. In the original coordinate system, this galaxy has 
velocity v 0 = H(r 0 ). Thus in the new coordinate system, the positions and velocities are given by 

f' = r — f 0 v' = v — v 0 = H(f ) — 7?(r 0 ) (4) 

Now according to the cosmological principle, we must have the same law in the new coordinate 
system, so that 

v' = H{?) (5) 

Putting (4) and (5) together, you see that 7i must satisfy 

H(r-r 0 ) = H(r)-H(r 0 ) (6) 

But this means that the function H (r ) must be linear — each component of TL must be just a linear 
combination of the components of r. This may be obvious, but let me belabor the point a little by 
giving you a “proof.” Look at some component of H, say H x , and write (6) as 

H x {n - r 2 ) = H x {ri) - TL x {r 2 ) (7) 

Now differentiate both sides with respect to Xi and x 2 (or y or z). On the right hand side, this 
gives zero because the two terms depend either on x\ or x 2 but not both. The left hand side can be 
written entirely in terms of the derivative with respect to x\ using the chain rule: 

d 2 

H x {fi - r 2 ) = 0 => (8) 

OX\OX2 

d 2 

fa ~ r 2 ) = 0 (9) 

(9) is true for all r 2 , so we can set r 2 = 0 and rename fj —> r, so 

= 0 ( 10 ) 
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This means that 7 ~C x (r) is at most linear in x. The same argument works for y and z. But setting 
fi = r 2 in (7) gives Tt x ( 0) = 0, and so we can write 

'H-xij"') ^xx-K T bxyV T bxz^ (11) 

where the 6s are constants. This works for the other components as well, so our proof is complete. 

Now isotropy, the assumption that all directions are equivalent, implies that there is no special 
direction. This implies that Tt(r) behaves like a vector under rotation. The only function of r 
that is linear, behaves like a vector, and does not involve any other fixed vector (which would be 
a special direction) is r itself, possibly multiplied by a constant. The constant may depend on the 
time t, but cannot depend on anything else. Thus 

The cosmological principle =>■ v = H(t ) r (12) 


This is the Hubble Law! If we put ourselves at the origin (which doesn’t matter because all points 
are equivalent) it says that a galaxy at r is moving with a velocity that is in the f direction — 
that is directly away from us — and with velocity proportional to |f|. Notice that the function 
H{t) has units of inverse time. The value of II(t) today is called the Hubble constant, H 0 . It is 
conventionally given in the rather ridiculous units of kilometers per second per megaparsec. It is 
sometimes further expressed in terms of a dimensionless quantity, h, defined by 


H 0 = 100 h 


km/s 

Mpc 


(13) 


where h is measured to be 0.71 ± 0.04. This means, for example, that for a galaxy at a distance of 
100 Mpc, we expect a recession rate of 7.1 x 10 3 kilometers per second, that is about 2% of the 
speed of light. To convert these crazy units into an inverse time, we can convert all the distances to 
light years, and the time to years, using the fact that c ~ 3 x 10 5 km/s=l light year/year: 


H 0 = 


^100 h 



1 light year/year \ ( 1 \ h 1 

3 x 10 5 km/s J \10 6 x 3.26 light years/Mpc J 10 10 years 


(14) 


Which implies that 1/H 0 , which is called the Hubble time, is about 13.7 billion years. You can 
see on your PC what this looks l ik e (though of course you can imagine) in the Mathematica file 

hubble.nb. 

Equation (12) describes what is called an expansion of the universe (at least if H(t) is positive, 
which it is today) because the distances between all pairs of galaxies grow in exactly the same 
way. It is as if the space between the galaxies is expanding at the rate H(t). This is illustrated in 
the animation in hubble.nb. The Hubble time, 1 /H 0 , is the time it would take for the universe to 
expand by a significant factor (e) if the time dependence of H(t ) is ignored. The Hubble time is 
thus a kind of approximate age of the universe — the time it has taken for the size of the universe 
to change a lot. 2 This gives rise to the notion of the observable universe, which is an imaginary 

2 To get a really accurate age, we have to understand how H(t) depends on time, which we will discuss later. 
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sphere with us at the center and radius of about c/ H 0 — that is 13.7 billion light years. 3 This radius 
is just the distance that light has traveled since the universe was much smaller than it is today. This 
also gives rise to the notion of the Big Bang. If we run the expansion backwards, we get to a point 
where things start to get complicated! We will talk more about this later. 

Of course, the Hubble law, (12), and the cosmological principle from which it follows are is 
not really true. The universe is not really the same everywhere and the actual motions of galaxies 
are more complicated than (12). Galaxies come in clusters and superclusters in which they orbit 
around each others in complicated ways. But the idea is that this complicated local behavior is 
superimposed on the simple Hubble expansion. Below is a very very rough guide to the sizes of 
things: 


The scales of the observable Universe 

~ 0.01 Mpc 

visible size of large galaxies 

~ 0.1 Mpc 

dark size of large galaxies 

~ 1 Mpc 

distance between galaxies 

~ 10 Mpc 

big clusters bound by gravity 

~ 100 Mpc 

largest regions of enhanced density 

l cosmological principle applies in this region???? J 

~ 10 4 Mpc 

~ c/Hq - the observable universe 


Newton’s theorem and the expansion of the Universe 

I’m sure you have all heard that general relativity implies that the universe might be curved, and 
might be finite, rather like the two dimensional space on the surface of a balloon. This would 
make it difficult for us to analyze it in purely Newtonian terms. Also, as we know only too well, 
Newton’s picture of gravity cannot be exactly right because it does not account for special relativ¬ 
ity. Nevertheless, at least if the universe is made of ordinary matter (an assumption that we will 
question in later lectures) we can use Newtonian gravity to leam something about the evolution 
of H(t). We can do this because of the cosmological principle. Since the universe is (assumed to 
be) the same everywhere, we can analyze the evolution of H(t ) in a sufficiently small region that 
the velocities never get relativistic, and the curvature of space is not important. Then we can use 
Newtonian mechanics and Newtonian gravity to discuss not just the velocities of the galaxies, but 

3 mmm 
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the evolution of their velocities. Let’s begin by adopting a coordinate system with us fixed at the 
center, and asking about the motion of a distant galaxy at r. 

This subject contains some subtleties. We will start with a deceptively simple analysis, and 
then go back and think about it. Consider the contribution to the forces on us and on the distant 
galaxy from a spherical shell of mass (galaxies) centered at the origin. Such a shell never makes 
any contribution to a force on us because of Newton’s theorem. So there is no force on us and we 
remain fixed at the origin of the coordinate system. However, the distant galaxy feels a force from 
every shell with radius less than r = \r |, pulling it towards us. The total force on the galaxy is 
towards us with magnitude 


47rr 3 Gm p 47t 

= — Gm pr 


(15) 


3 r 2 3 

where m is the mass of the galaxy and p is the mass density of the universe at the time. 

Now what does this mean? The obvious problem with the analysis is that it seems to depend on 
choosing the center of the spheres to be where we are. Why not choose the center some place else. 
If we do that, the force on us will not be zero, so how can we say that we are in an inertial frame? 
It is true that if we do this the galaxies are moving in some complicated way through our spheres, 
but that should make no difference to Newton’s theorem because in Newtonian mechanics, the 
gravitational force depends only on the instantaneous positions of the various masses. 

One problem here is one of infinity. If you think about the force on any galaxy in the universe 
from all of the others, in Newton’s theory, the answer is simply undefined. Replacing the sum over 
galaxies by an integral over the position of the galaxies (which should be a good approximation 
for distances much larger than the typical distances between galaxies, or at least clusters) it is 


F = Gm p 


Sr' 



(16) 


This integral is not well defined because it gets contributions from regions of the universe that are 
infinitely far away. It depends on exactly how we take the limit that extends the integral over all 
space. If we consider (16) as the limit of an integral over a large sphere as the radius of the sphere 
goes to infinity, then this reduces to the previous discussion, and as we have seen, the answer 
depends on where we choose the center of the spheres. In general, the limit depends on some 
complicated way on how one takes the volume to infinity. This is not useful. So what is actually 
going on? 

First note that because of the equivalence principle, we can think about gravitational accelera¬ 
tions, rather than forces — this is nice because the mass of the galaxy cancels out. 

/ (V» f _ /p 

Sr' — -— (17) 

r — r r 


However, this doesn’t help with the infinite volume problem — we don’t know the acceleration of 
a given galaxy any more than we know the force on a given galaxy. 

But fortunately, we are not really interested in the acceleration of any particular galaxy. What 
we mean by a coordinate system with us fixed at the center is that the coordinate system is acceler¬ 
ated by the local gravitational field at our position. Thus the situation is the same here as with the 
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tidal force. The actual acceleration that we care about on a distant galaxy is the difference between 
the gravitational acceleration on the galaxy and the gravitational acceleration on us. 

/ rp ? _ p p ^ 

dV—- z^-Gp / d'V — -—- (18) 

\r — r \ 6 J \r — Tq\ 6 

In general relativity, we would simply say that we can go to a locally inertial frame in which there 
is no gravitational field where we are. 

While the gravitational acceleration on a particular galaxy is not well defined, the difference 
in the acceleration on any two galaxies is much better defined. You still have to be careful about 
how you take the limit of infinite volume. But unless you do something dumb the difference makes 
sense. For example, it is easy to see that if we define the limit in (18) by taking a large sphere to 
infinity, the difference is independent of where we take the center of the sphere. 

Since we computed (15 ) by adding up the contributions of spheres centered on our position, 
the acceleration of gravity on us computed in this way vanishes. Thus when we compute the 
acceleration on the distance galaxy in the same way, we are actually computing the difference 
between the acceleration of the galaxy and that of our own, which is what we want. Likewise, if 
we compute the difference, then it will not matter where we put the center of our spheres — we 
will always get (15 ). So, understood properly, the simple result is correct, though the argument is 
inadequate! 


Hubble dynamics 


We can now follow the distant galaxy as it moves with the expansion of the universe. The acceler¬ 
ation is 

r = -ryGp(t) (19) 

(19) is consistent with the Hubble law in the following sense. If at some time, the Hubble law, (3 ), 
is satisfied, the dynamics implied by (19) implies that it will be satisfied at subsequent times. To 
see this explicitly, note that (19 ) can be written as 

±v = -rf G p(t) (20) 


Thus the rate of change of v is proportional to r, so if v is proportional to r at some time, it stays 
that way. 

To say more about the evolution of the expanding universe, it is convenient to rewrite (19) as 
follows: 

•v ^4vr _^GM 

r — —r — G p = —r —— (21) 

3 r 6 


where 

„, 47rpr 3 

M = —— 
3 


( 22 ) 


is the mass inside an imaginary sphere of radius r = |r|. That is M is the sum of the masses of all 
the stuff that is closer to us than the galaxy of interest. The advantage of writing things in terms 
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of M is that M doesn’t change as r changes. The size of the imaginary sphere changes as the 
galaxy we are looking at moves farther way, but on the average, because all parts of the universe 
are expanding at the same rate, stuff doesn’t cross the surface of the imaginary sphere. So M 
remains constant. This is illustrated in the Mathematica file hubble.nb it you set sphere = 1. 

Taking the dot product of both sides of ( 21) by r, we can write the left hand side as 


^ -v d 1 
dt 2 


( r 7 


(23) 


and the right hand side as 


^ _^GM u f G M 

'f . f - — — f . — - — f 


dr d G M 


ry* j’2 ry* 


u 8 GM 


dr r 


or 


which implies that 


d 1/^2 _ d GM 
dt 2 ' ' dt r 



GM 

-= constant 

r 


1 

2 


C 


or 



4vrGp 2 

- r 

3 V ; 


1 

2 


C 


d GM 
dt r 


(24) 

(25) 

(26) 
(27) 


The constant C in (26) and (27) depends on the initial conditions. There is a theoretical prej¬ 
udice today in favor of C = 0, for reasons that I will try to explain later . 4 This means that the 
universe lives in a flat space, which makes it somewhat easier to understand. However, it is still 
possible that 67 7 ^ 0, in which case general relativity implies that the universe is curved. We won’t 
talk about this in detail, except to note that if 67 is negative, the universe is finite and positively 
curved, like the three dimensional analog of the surface of a sphere, while if 67 is positive, the uni¬ 
verse is negatively curved and infinite. We will assume that C — 0, and just briefly discuss what 
happens in the other cases. If we ignore relativity, the physical significance of C can be easily seen 
from (26). If 67 > 0, the expansion of the universe will go on forever, and as the distance to the 
distant galaxy r goes to 00 , the speed of the galaxy will go to a finite nonzero limit. If 67 < 0, 
there will be a largest value of r at which the galaxy stops, and gravitational attraction turns the ex¬ 
pansion of the universe into contraction. If 67 = 0, the balance of kinetic energy and gravitational 
attraction is such that the speed of the distant galaxy goes to zero as r —> 00 , but it never stops and 
turns around. 


Dynamics of a flat universe 

Using the Hubble law, (3), in (27) gives 


1 

2 


H(tf (f ) 2 


AnGp 2 

- r 

3 V ’ 



4 In any event, there is a sense in which the universe is remarkably close to C = 0. 
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Taking C = 0 in (28), dividing both sides by r 2 and putting in the time dependence of pit) 
explicitly, we find 

8nG p(t) 


The density implied by (29), 


H(ty = 


Pc(t) = 


3 Hjtf 
87 xG 


(29) 


(30) 


is called the critical density. Because both p(t) and II{t) are measurable, at least in principle, (29) 
is a test of the flatness of the universe (assuming that the rest of our assumptions are correct). If the 
measured density is greater than critical, p(t) > p c (t), then C in (27) is negative, and the universe 
is closed and positively curved. If the measured density is less than critical, p(t) < p c (t), then C 
in (27) is positive, and the universe is infinite and negatively curved. 

The matter that we can actually see in the universe contributes only a very small fraction of the 
critical density. However, this doesn’t mean that the universe is not flat. 


Newton’s theorem and dark matter 

There is a lot of evidence that galaxies contain much more matter than we can see. What we see 
we see because it emits light — it is called luminous matter — mostly stars or gas clouds. The 
most convincing evidence that there is a lot of other stuff comes from studies of the orbital velocity 
of gas far from the galactic centers, beyond almost all of the visible matter. What one sees is that 
the speed at which things orbits around the center of the galaxy remains roughly constant with 
distance far beyond the radius at which you see significant visible matter. Newton says that 
for a circular orbit at radius r in a spherically symmetric collection of mass, 


v 

r 


GM(r) 


(31) 


where M(r) is the mass inside a sphere of radius r. Thus one simple way of explaining the 
constancy of the orbital velocity is to assume that there is a large spherically symmetric halo of 
dark matter around each galaxy with a density that goes approximately like 1/r 2 , where r is the 
radius. Then the amount of matter contained in a sphere of radius r is 


M{r ) = / 47T r' 2 dr' p{r') oc / dr' oc r 
Jo Jo 


(32) 


which cancels a 1/r on the right hand side of (31), so that the rotation speed v remains constant. 
This much of the subject you should understand. 

Whatever it is that contributes this mass density proportional to 1/r 2 (except for the stars and 
gas that we see) is called dark matter. So what is this stuff? Is it real? Is it matter that we know 
about or something else? 

Recently, another interesting way of looking at the effect of dark matter has emerged. The 
rotation speed arguments we have just talked about involve the effect of the dark matter’s gravity on 
ordinary matter. This has more-or-less convincingly established that there is dark matter associated 
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with the luminous matter in galaxies. There are also indications that there is more dark matter 
associated with clusters of galaxies. Weird as it sounds, one can also see the effect of dark matter’s 
gravity on light. The idea here is to use so-called “gravitational lensing.” Because light can be 
bent by a gravitational field, it is possible that a massive cluster of galaxies could bend the light 
from an even more distant galaxy (or quasar) so that we on earth would see the light coming at 
us from several directions. 5 This has actually been seen, first in radio-telescope observations, and 
more recently and spectacularly in Hubble Space Telescope images and other visual data. Figure 4 
shows one such image. In this Hubble photo, around the bright oval galaxies of a distant cluster, 



Figure 4: Hubble images showing gravitational lensing. 

we see the wispy arcs of a still more distant object, focused and distorted by the gravitational lens 
produced by the dark matter of the cluster. One can use this phenomenon to study the distribution 
of the dark matter. Figure 5 dramatically shows the result of such a study of another cluster of 
galaxies - a plot of the matter density in a cluster, showing high peaks at the visible galaxies, but 
an enormous background of dark matter between them. 6 

If the dark matter really is some kind of matter, there are indirect but fairly convincing argu¬ 
ments that it cannot be made out of normal stuff. The evidence comes from the fact that we have a 
picture of how light nuclei are formed in the very early universe that works reasonably well, and a 
lot more normal matter would mess up this nice picture (of what is called “primordial nucleosyn- 

5 or even, if the geometry is just right, from an “Einstein ring.” 

6 from a Bell-Labs new release in 1997. 
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Figure 5: Mass distribution in a cluster of galaxies showing dark matter bakground. 

thesis” — for example, if there were enough baryons to account for all the dark matter, the theory 
predicts that there would be much less deuterium in the universe than we actually see!). So most 
people assume that it is some kind of new particle! There are many ongoing experimental efforts 
to see dark matter particles as the earth moves through the dark-matter halo of our galaxie. At 
least one experimental group believes that that they have seen something, but the data so far is not 
convincing, nor does it tell us much about what this stuff might be. 

I am not going to talk about all this in detail. Here is a link to reviews on the web: 
astron.berkeley.edu/~mwhite/darkmatter/dm.html — A brief but up-to-date introduction with 
additional links. 
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lecture 24 

Topics: 

Where are we? 

Running Hubble backwards 
Relativistic cosmology 
Back to the big bang 
Thermal equilibrium 
The hot bang and the CMBR 
Temperature and phases 

Where are we? 

After our introduction to the Hubble expansion in the last lecture, we are now going to work our 
way back towards the beginning of the universe. This is sort of fun, for its own sake, but it will 
also, I claim, get us closer to an answer to the question we asked at the very beginning of the course 
— why is Newton’s third law a formula for acceleration. 


Running Hubble backwards 


Let us start by going back to the laws we derived to describe the evolution of the Hubble expansion. 
If we put ourselves at the origin and take r to be the position of some distant galaxy, the Hubble 
law can be written as 

r — H(t) f (1) 

where H(t) is the Hubble parameter. Then, assuming that we can use Newton’s theory of gravity 
and ignore relativity, we found that gravity slows the Hubble expansion 

f=-ryGp(t) (2) 

Then by considering the conservation of the mass inside a sphere with radius \r | (still ignoring 
relativity), we found that we could integrate (2) to obtain 


1 2 4vr Gp{ff 

^(v) --- 



(3) 


where the constant C depends on the initial conditions. Again, this derivation was valid for a 
universe dominated by slowly moving matter, so long as the cosmological principle is satisfied. 

It is conventional and convenient to get rid of the vectors in ( 1 )-( 3 ). We can do this by defining 
an (arbitrary) distance scale a between points in the universe. For example, we could take a = \r |, 
the distance between us and the distant galaxy. But the notion is more general, as it has to be 
because we want to use it to describe the universe at earlier times, before we or even galaxies were 
around, a is just an arbitrary scale that measures the relative size of the universe. The value of a 
doesn’t matter at all, but if a doubles, that means that the distance between (far apart) things in the 
universe has doubled! 

Using a = \f | and that fact that the direction of f doesn’t change with time, we can write 


r — ar 


r — ar 


r = ar 


(4) 


1 




(5) 


and rewrite (1 )-(3 ) as 


a 

a 


4 = Hit) 
a 

47r 

= ~Gp{t) 


( 6 ) 


and 


- 2 =h 2 = 

a 2 


8tt G p C 


3-2 

a 2 


(7) 


Now for simplicity, let us assume that C — 0. As I have said before, there are theoretical 
reasons (that we will discuss next time) to think that this is a good approximation, and some ob¬ 
servational support for the detailed picture that emerges. We can then follow the Hubble evolution 
backwards towards the big bang, at least for a while. 

For C = 0, we can take the square root of (7) and write 


a 8ir G p 
a ~ V 3 


Note that have taken the positive sign, corresponding to expansion. 

To determine the time evolution of a, we need a relation between a and p. This relation depends 
on what kind of stuff our universe is made of at the time of interest. For slowly moving matter, 
we can simply use the fact pa 3 is a constant, because the density is inversely proportional to the 
volume. Thus 


0 = y (pa 3 ) = pa 3 + 3pa 2 a 

CLL 


a 

a 


1 P 
3 p 


or p = —3 pa/a 


(9) 


Thus we can write (8) as 

This is a differential equation that we can solve for p (or course, the result is only meaningful as 
long as our assumptions are valid): 


p = — V247T G p 3 / 2 


( 11 ) 


P 


1/2 


- 1/2 


dp 

W 2 


— V24n G J dt = — V247T G (t — t 0 ) 


( 12 ) 


p 1//2 = \/Qti G (t — t 0 ) 
1 

^ 6tt G (t — t 0 ) 2 


(13) 

(14) 


where t 0 is a constant set by initial conditions. As expected, the density decreases as the universe 
expands and increases as we run the tape backwards. Since pa 3 remains constant as t increases, 
(14) implies that a grows as 


a oc (t — t 0 ) 2/3 


(15) 


2 













So if this were the whole story, the expansion of the universe would look something like this 



To recapitulate, two separate pieces of physics are required to understand how the scale a evolves 
with time. One is the is the relation (7 ) (or (8 ) for C = 0) that describes the effect of gravity on the 
expansion. The other is the relation (9) that describes how the density p changes with changes in 
the scale factor a. A third relation, (6 ), is not independent of these two. We can derive it from (7) 
and (9). One way to do this is to multiply (7) by a 2 and differentiate (going backwards through the 
steps we used to get (7)) to get 


2 da 


8nG 

~Y~ 


(pa 2 + 2 pad) 


87 tG 

~Y~ 


(—3 pad + 2 pad) 


Dividing by 2 ad then gives ( 6 ). 
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( 16 ) 


Relativistic cosmology 

The simple behavior implied by (14) and (15) is not consistent with the universe we actually 
observe. A couple of important things have been left out. One of these has been known for a long 
time. There is good evidence that the big bang was hot as well as dense. Heat complicates matters 
in a couple of ways. Heat is the random motion of the constituents of matter. The higher the 
temperature, the faster the motion. But once stuff is moving around, we have not only density, but 
also pressure — we left pressure out of our simple analysis of the Hubble dynamics. Furthermore, 
as we go back farther in time, the temperature gets higher and higher, and eventually, the random 
motion of the particles approaches the speed of light. Then we cannot ignore relativity! In this 
regime, as we know from our study of relativity, all sorts of bizarre things happen. Particles are 
created and destroyed. Energy and momentum are conserved, but not mass. We must be careful. 

A complete understanding of what happens when things get relativistic requires that we gener¬ 
alize Newton’s theory of gravity to incorporate special relativity. The resulting theory is Einstein’s 
general relativity. We are not going to discuss this here, except for one result, which will be all we 
need. It turns out that (7 ) remains correct in general relativity if p is interpreted not as the mass 
density, but as the relativistic energy density divided by c 2 , or just the energy density in sensible 
units in which c = 1. This makes sense in that if we go back to low temperatures and things come 
to rest, the energy density just becomes the mass density. But in the relativistic regime, it just turns 
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out that gravity affects all forms of energy, not just mass (that is rest energy), so we have to use the 
energy density instead in (7). It is worth writing (7) again, now that we know how general it is: 


~=H Z = 

a 2 


8n G p C 

-^-1-o 


(17) 


This is called the Friedmann equation. The parameter C here is related to the curvature of the 
space of the universe. C = 0 corresponds to flat space. 

Let’s see what becomes of (9) and (6) in the presence of pressure and relativity. First consider 
(9). To think about how this changes, let’s consider the total energy in a cube with side a, and see 
how it changes with time in an expanding universe. Remember that a here is some distance that 
changes along with the space of the universe. This cube is like the sphere in hubble.nb except 
that now everything is bouncing around randomly and bumping into everything. On the average, 
no stuff comes into or goes out of the cube, or at least as much stuff comes in as goes out. We 
could think of it as having impermeable sides or pistons on all sides or something like that, so 
that collisions with the sides take the place of particles going in and out of the cube. This isn’t 
quite right for any particular collision, but it should work for averages. And since temperature 
and pressure are the result of random processes, averaging over a very large number of individual 
processes, this is really what we want. Because we are now intepreting p as the energy density, the 
energy is pa 3 . Now in this picture with the impermeable sides, because of the pressure, the energy 
changes with the volume — the pressure p does work on the boundary, which reduces the energy. 

Let’s analyze this quantitatively in two ways - first for the cube, and then for a region of 
arbitrary shape. Both will give the same result. 

It is easiest to see what happens for the cube if we put one comer at the origin, so this corner 
doesn’t move with the expansion. Then, as shown in the figure below, when a —> a + da the three 
sides that do not touch the origin each move out a distance da. The area of each face is a 2 (it 
doesn’t matter whether we use a 2 or (a + da) 2 or something in between because the difference is 
infinitesimal). Thus the force on the face is p a 2 and the work done on each face is p a 2 da. Because 
there are three faces, the energy lost is 3 pa 2 da, which is the change in energy in the cube. Then 
because 3a 2 da = d(a 3 ), we can write 


d(p a 3 ) = —3p a 2 da = —p d(a 3 ) (18) 



Now let us do the analysis for a region of arbitrary shape. The work done by the pressure in an 
infinitesimal change of the boundary is a surface integral over the boundary of the force dF dotted 
into the change in the position of the boundary, db. The force of pressure has magnitude equal to 
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the area dA times pressure p, and its direction is normal to the boundary. Thus the work done is 
the pressure times the area times the perpendicular motion of boundary, which is equal to p times 
the change in the volume — 


dE = f dF ■ db = [ pdAdb±=pdV (19) 

Ja Ja 

Thus here because V = a 3 , (19) implies 


which agrees with (18). 
(20 ) implies 




p a 3 + 3 pa a 2 = —3 p a a 2 


( 20 ) 


( 21 ) 


or 

p=-3 a (p + p ) (22) 

a 

Notice that this reduces to (9) when the pressure vanishes, as it should. 

Using (22), we can find the appropriate generalization of ( 6 ) by following the steps we used in 
(16 ). Multiply (17 ) by a 2 , and differentiate with respect to time. 


d . 2 d 8n G pa 2 

dt dt 3 


(23) 
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(—3(p + p) da + 2 pa a) 


(24) 


Dividing both sides by 2ad gives 

a AnG. 

- = -P + 3p (25) 

a 3 

This is the relativistic generalization of ( 6 ), to which it reduces when p — 0 and p is the rest energy. 
Note that it is not at all obvious why it is (7 ) that does not change when we go to the relativistic 
limit, while ( 6 ) gets generalized to (25 ). This requires knowing that (7), with p reinterpreted at the 
energy density, is the correct general relativistic result. 


Back to the big bang 

Armed with the results of the previous section, we can go back a bit further into the history of 
the universe. But now that we have included the effect of pressure, we also have to understand 
the relation between the energy density p and the pressure p for relativistic stuff. An important 
example of relativistic stuff is a gas of photons — particles of light — radiation. We can compute 
the pressure in any shape container, so consider the pressure of a gas of photons with energy density 
p in a cubical container of side a. The pressure of such a gas arises because the photons bounce 
off the sides of the container. Suppose the sides are lined up with the coordinate axes. A photon 
with energy E and momentum p, if all the components of p are nonzero, bounces around off all the 
sides. Each time it hits a side, the component of momentum perpendicular to the side changes sign, 
but the others remain unchanged. Consider the force on the sides perpendicular to x. When the 
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photon bounces off this side, it imparts an impulse 2\p x \. The time it takes to get back to the same 
side, because it must go a distance 2a in the x direction, is 2a/\v x \, where v x is the x component 
of the velocity, which is p x /E. The contribution of this photon to the impulse per unit time, which 
is the force, is thus 


2|Px| E 2 

-= — V 

2a/\v x \ a x 


(26) 


The contribution to the pressure is the force (26) divided by the area a 2 . Then we get the total 
pressure by summing over all the photons 


p = E*»! < 27 ) 

GT 

But the pressure is the same on each side, so because the velocity of each photon is 1, the result 
must be 



The pressure of a relativistic gas is 1/3 the energy density (in relativistic units, with c = 1, of 
course). 

Now let us apply (28) to understand how the energy density of a gas of photons changes with 
the Hubble expansion. Forp = p/3, (22) becomes 

p=-4-p (29) 

a 

This implies that p a 4 is constant. Thus the energy density of a relativistic gas falls like a -4 as 
the universe expands, faster than the energy density of nonrelativistic matter, which falls like a -3 . 
It may help to understand this result to realize that this means that the energies of the individual 
relativistic particles must be falling like 1/a, because their number density clearly falls like a -3 . In 
the expanding box derivation above, the energies fall because they lose energy in the their collisions 
with the sides of the box. In the universe, there isn’t any box, but if the relativistic particles are 
in thermal equilibrium, they lose energy bouncing off the other particles around them which are 
expanding away from them (just like the sides of the box). 

An interesting thing about this is that the relation (29) remains true even when the density of 
the relativistic particles becomes so low that they no longer bounce off one another very often. But 
even in that case, (29 ) is satisfied in an expanding universe. What is happening is the that energies 
are effectively red-shifted down as the universe expands. The larger the universe is, the farther 
away the relativistic particles reaching some particular point are coming from. But the farther 
away they came from, the more red-shifted they are (because of the Hubble expansion). If the 
particles have mass, the process eventually stops when the particles become nonrelativistic. But 
for photons, and other massless particles, it continues forever and (29) remains true as the density 
goes to zero. 

Furthermore, going in the other direction, this means that if we follow the history of a rela¬ 
tivistic gas back in time, it not only gets more dense as we go back, but because the energies of 
the individual particles are increasing, the temperature increases as well. It turns out (as we will 
see below) that the average energy is proportional to the temperature, so the temperature of the 
relativistic gas goes like 1/a. 
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Thermal equilibrium 


There is one more component to the now standard picture of the hot big bang. One assumes that 
at some time in the early history of the universe, all of the particles were in thermal equilibrium 
at an enormously high temperature. What thermal equilibrium means is that the particles collide 
frequently enough that their motions are thoroughly randomized. You might think that this ran¬ 
domness would make it hard to understand how such a hot universe works. But in fact, exactly 
the opposite is true. In thermal equilibrium, all the important properties are determined on the 
average by a single parameter — the temperature. You can them follow the subsequent evolution 
of the universe, at least on the average, using the tools we have developed above. This seemingly 
paradoxical situation is beautifully explained in one of the best popular science books I know of, 
Steven Weinberg’s The First Three Minutes, which I recommend for any of you who have not 
already read it. You will also learn (much) more about the connection between temperature and 
randomness if you take Physics 181. But it is so beautiful that I want to take time out from our 
study of the universe and talk just a little bit about thermal equilibrium and randomness. 

A beautiful area of physics called “statistical mechanics” is the study of how random motion 
of particles is related to classical notions like heat and temperature. “Thermal equilibrium” is one 
of the basic concepts here. The idea is that randomness allows us to use probalistic arguments to 
understand the physics. What makes this powerful is that the number of particles is very large. 

I will give you one example of the power of this approach — the Boltzmann distribution. 
Suppose that a system with a fixed, very large number N of degrees of freedom is in thermal 
equilibrium. The degrees of freedom may be the components of the position of single particles, 
or they may include additional coordinates like the orientation of molecules. It doesn’t matter 
what they are. But we will assume that these degrees of freedom are essentially free between 
collisions, but that collisions happen frequently and randomly, so that each degree of freedom that 
is in thermal equilibrium is thoroughly randomized with all the others. 

The coordinate for each degree of freedom Q 3 has a corresponding generalized momentum V t . 
The only important facts that goes into the Boltzmann distributution are the thermal randomness 
that makes each degree of freedom the same on the average, the fact that N is very large, and the 
fact that there is a conserved energy that is quadratic in each of the momenta. It is easiest to see 
what is going on if we choose the normalizations of the momenta so that the energy is just the sum 
of the squares, 

N 

Etot = E V J (30) 

3 = 1 


For example, for an ordinary space momentum, this just means absorbing a factor of 1 /\J 2 m so 


that 



(31) 


This just makes the derivation easier because we don’t have to carry around the extra factors. 

Now the statement of the Boltzmann distribution is that the probability of finding a degree 
of freedom with momentum between V and V + dV is completely calculable in terms of the 
temperature — 


B(V) dV 


dP v 2 

, - e 2B >v 

V27T E av 


dP v 2 
— / 6 kT 

a/ irkT 


(32) 
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The function 


1 


1 


(33) 


B(V) = 


\j2n E a 


y/nkT 


is the Boltzmann distribution, where E av is the average energy per degree of freedom which is 
related to the temperature by Boltzmann’s constant, 


k = Boltzmann constant = 1.38 x 10“ 23 J/°K = 8.62 x 10“ 5 eV/°K (34) 


according to 

kT 

E av = — (35) 

It is easy to understand the exponential factor in (33). The key to the probability argument is 
simply conservation of energy, (30). Because of (30), any particular degree of freedom can have a 
generalized momentum anywhere between —"P max and V mnx where 

T^max — \j E tot (36) 

But if one degree of freedom has generalized momentum V\, then the maximum possible value of 
all the others is reduced to 

PL* = EEEvi = - P? ( 37 ) 


This is the basic physics. It is unlikely to find one degree of freedom with a very large generalized 
momentum V\ because conservation of energy would then restrict the possible momentum values 
for the very large number of other degrees of freedom. Because the possible range of each of the 
other degrees of freedom is reduced by a factor of 

= V 1 - PiUPL* ( 38 ) 

' max 


the probability of finding a value V\ for one degree of freedom should contain a factor of 

'VLA N ~ 1 /_ ,„o \(N-l)/2 


V n 


(l - vl/v\ 


2 

max 


(i - pI/pU) 


\N/2 


(39) 


The limit that we are interested is N very large with the total energy growing linearly with N so 
that the average energy per particle is fixed. 


T>L* = E tot = NE : 


(40) 


so (39) becomes 

v\ \ N/2 

NE av ) 


(41) 


In the limit of very large N, the exponential factor (41) overwhelms any other dependence on the 
particular degree of freedom and gives the exponential in the Boltzmann distribution: 

( V 2 \ N/2 

lim 1 — — = e = e kT (42) 

n^oo y N E av ) 
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The rest is just a normalization factor to make the total probability equal to one. You can see from 
this argument that the factor of 1/2 in the relation between E av and kT, (35 ), comes from the fact 
that the energy is a quadratic function of V. 

This is illustrated in the Mathematica worksheet boltzmann-dynamic.nb. Here we start with 
a uniform random distribution of the momenta between —1 and 1 (in some arbitrary units) of 5000 
particles moving in 2 dimensions. The momenta are then randomized by a series of collisions, each 
conserving energy and momentum. Particle 1 collides with 2, then 2 with 3 and so on to 1000. The 
whole process is then repeated 10 times and you can watch as the momenta get randomized. You 
can see dramatically how the initial uniform distribution gets converted in a Boltzmann distribution 
by the power of probability. 

The hot bang and the CMBR 

Let’s assume that the hot big bang picture is correct, and think about what the universe looked 
like when it was very hot. First of all, normal matter, made of neutral atoms certainly didn’t exist, 
because the high energy collisions would have completely ionized all the atoms. The universe 
would be a plasma. Furthermore, since particle number is not conserved in relativistic collisions, 
particles and their antiparticles can be produced and destroyed. So for example, while it seems 
that in the universe today, there are a lot more electrons than there were positrons, long ago when 
the universe was less than a millionth its current size, there were almost as many positrons as 
electrons. The small excess of electrons that eventually became the electrons in our atoms was 
quite unimportant at early times. For the same reason, at even earlier times, there was a lot of other 
stuff around in the early moments of the universe that we don’t see much of today — heavy unstable 
particles which today we can make only at large accelerator laboratories and which quickly decay 
back into ordinary stuff were as common in the very early universe as electrons. These heavy 
particles disappear when the universe cools to a temperature such that the typical particle energy is 
below their mass. It is all quite strange — but simple, in a funny way, because everything is more 
or less fixed just by the temperature. 

Now why would anyone believe this? We cannot, after all, go back and do experiments on the 
early universe. Why is this discussion science? The answer is that we can almost see it! At least we 
can look back toward the beginning of the universe by looking far away in the universe, because the 
light from far away regions of the universe has taken a long time to get to us. But we can’t look back 
all the way. Once the universe gets so hot that atoms dissociate into ions, photons cannot go very 
far without colliding with electrons — the universe becomes opaque. Thinking about this in the 
other direction is even more interesting. As the universe cools to below the temperature at which 
atoms dissociate (a few thousand degrees C), it becomes transparent to photons, which means 
that photons fall out of thermal equilibrium. From then on, most of the photons just move freely, 
never colliding with anything again. This “gas” of photons continues to behave like relativistic 
stuff, while the atoms are nonrelativistic. Thus as the universe continues to expand, the energy 
density in the photons gets less and less important to the overall Hubble evolution, but the photons 
are still there, getting more and more red shifted as time goes on. This gas of photons from the 
formation of atoms, a few hundred thousand years after the big bang, is the Cosmic Microwave 
Background Radiation (CMBR). A tiny fraction of these photons hit the earth and can be detected. 
Much of what we actually know about the early universe comes from studies of the CMBR. The 
first obvious thing to do is to measure the temperature, which turns out to be about 2.7°C, which is 
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about 1000 times smaller than the temperature at which atoms come apart into ions. This means, 
since photon temperature and energy is inversely proportional to a, means that the universe today 
is about 1000 times bigger today than it was when atoms first formed. There is actually much more 
to this statement than meets the eye. It is an important prediction of the hot big bang model that 
CMBR looks like it has a temperature at all. The reason it does, even though the photons are no 
longer colliding very much, and are not in thermal equilibrium, is that the random distribution of 
photon energies that was present when the universe first became transparent is still there — just all 
the energies have been scaled down. This prediction has been confirmed by looking at the CMBR 
in many different regions of photon energy, and checking that the distribution of energies is what 
one would expect in a thermal distribution. 

We cannot directly see the universe at scales smaller than 1/1000 the current scale. However, 
we can use the tools we have discussed to follow the universe back to small sizes and higher 
temperatures and energies. We can go back about a factor of a trillion (10 12 ) before we get to 
such high energies that have not directly seen the physics in laboratory experiments, so we are 
reasonably confident that we have the picture right back that far. And there are some observable 
consequences. For example, most of the nuclei of light elements (deuterium, helium, etc) were 
formed back when the universe was a million times smaller and protons and neutrons first started to 
stick together. The hot big bang picture predicts a specific pattern of abundances of these elements, 
which can be checked by looking for them out in the universe. Tins is much less direct and more 
problematic than direct observation, because one worries about what has happened to these nuclei 
in the 10 billion years since they were formed. Nevertheless, there is interesting information to be 
had in this way. 1 

Another area of recent progress comes from the observation of small but regular temperature 
differences in the CMBR. These are associated with a funny kind of driven oscillation in the early 
universe. I think that what is going on can be described as follows. 2 The “oscillator” is the electron- 
proton-photon plasma. Before “recombination” when the electrons and protons (or heavier nuclei) 
form atoms, the electrons, protons and photons are tightly coupled together by collisions and form 
a gas which like air has a pressure (provided by the photons) and inertia (provided mostly by 
the photons but a little by the mass density the protons), and therefore a speed of sound. The 
“driver” is the primordial density fluctuations and gravitational instability of the dark matter. These 
fluctuations produce waves of density fluctuations in the plasma that propagate outward from a 
region of enhanced dark matter density like ripples on the surface of a pond disturbed by a rock. 

This wave propagation stops when the plasma recombines into atoms, gets frozen into the pattern 
of the CMBR on the sky, showing up as a wave in the temperature of the CMBR photons. These 
are known as acoustic peaks (check out the discussion and movie in this link). They are seen by 

'Again, see Steven Weinberg’s The First Three Minutes for more details. 

2 For a more detailed, but understandable description, see background.uchicago.edu/~whu/intermediate/intermediate.html 
on the web or the Scientific American article by Hu and White, background.uchicago.edu/~whu/Papers/HuWhi04.pdf. 
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instruments like the WMAP sattelite 3 



Angular Scale [Degrees] 

100 20 5 2 1 0.5 0.2 



Eventually, we get back to such high temperatures that we really don’t know how the physics 
works. But we suspect that one thing that shows up is interesting structure in the vacuum. We’ll 
talk more about this in a moment. 

Here is a web link to more information about the Cosmic Microwave Background: 
http://background.uchicago.edu/ 

This is a very nice site maintained by a faculty member at Chicago. 

Temperature and phases 

The plasma that appears at high temperature in the early history of the big bang is an example 
of a “phase” of matter The idea of “phase” is one of the most familiar concepts in the physics of 
matter. You know from childhood about the three phases of H 2 0 - water-ice-steam. And you know 
that phases can change depending on temperature and pressure. This can have a big effect because 
physical properties can change dramatically in a “phase transition” as you go from one phase to 
another. I will close by showing you a couple of phase transitions. 

’See http://www.astro.ucla.edu/^wright/CMB-DT.html 
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lecture 26 

Topics: 

Before the big bang 
Inflation 

The cosmological principle and the Taylor expansion 

The cosmological constant 

The laws of physics and the Taylor expansion 

Before the big bang 

One reason that the cosmological principle has historically been treated as a sort of philosophical 
assumption is that until recently it was hard to imagine any sort of reasonable physics that could 
give rise to it. The reason that it seems difficult is special relativity. To see the difficulty, let us first 
think about the physics in a region of the universe that lies beyond the boundary of the observable 
universe. By definition, this is a region that we cannot see, because light from this region has not 
had time to reach us since the big bang. But according to special relativity, that means there is no 
way that any information can have been transferred from here to there or vice versa. This makes 
it hard to imagine any physical process that could establish the equivalence of this region of the 
universe with ours. Yet the cosmological principle requires that these two regions be equivalent. 
This does not mean that the cosmological principle is inconsistent. It just means that whatever 
established the cosmological principle did so as a kind of initial condition — before the big bang 
started the Hubble expansion. This makes the cosmological principle sound a bit fishy. 

In fact, the situation is even worse. We don’t really much care about whether the cosmological 
principle holds beyond the observable universe. But consider two regions on opposite sides of the 
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observable universe. 
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boundary of the visible universe 


These are regions that we can see, and what we see is that the microwave background radiation 
from these two regions looks the same to very high accuracy. We can see directly that the cosmo¬ 
logical principle holds for these two regions. But since light has just barely had enough time to 
reach us from these two regions, there is absolutely no way that light (and therefore information) 
can have gotten from one of these regions to the other in the history of the universe as we know 
it. Cosmologists refer this as the “Horizon Problem” because the boundary of the visible universe 
is a ki nd of horizon, beyond which we cannot see. The problem is that the similarity of the CMB 
from different regions of the cosmic horizon looks l ik e a miracle, because no physics could have 
established it without violating the precepts of special relativity. The similarity must have been 
there as an initial condition before the big bang. Someone or something must have decreed that 
these regions looked the same before the conventional Hubble expansion started. What is going on 
here? 


Inflation 

One possible answer was suggested in the 80s by Alan Guth (now at MIT). I will describe the 
orginal version of Alan’s idea, even though it is not exactly the modem view of what happens 
because I think that it is the easiest to explain and is related to some interesting physics in the ev¬ 
eryday world. Alan thought about the effect on cosmology of the funny structure of the relativistic, 
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quantum mechanical vacuum. Like water, the vacuum is really a pretty complicated place. It has 
different phases. It breaks symmetries of the underlying interactions because it has structure that 
allows it to exist in different orientations in some odd quantum mechanical space. We have very 
strong reasons to believe that the vacuum can exist in different phases, each Lorentz invariant, but 
with different energy densities and other properties. At low temperatures, the phase we live in has 
the lowest energy and is stable (we hope). But at the high temperature of the big bang, one of the 
other phases might be the lowest energy state and be stable (like steam is more stable than liquid 
water at high temperatures). Such a state, if it exists, is called a “false vacuum.” 

Suppose that there is some continuous parameter (there may be many) that describes the config¬ 
uration of vacuum. Then one way that multiple phases can happen is that energy of the vacuum as a 
function of this parameter has a couple of local minima, but that the curve depends on temperature. 
So at low temperatures it might look like 



with the stable phase corresponding to the minimum on the right while at high temperatures it 
might look like 



with the stable phase corresponding to the minimum on the left. This is the situation with water 
and steam in the real world. At high temperatures then right after the big bang, the vacuum state of 
the universe would typically be in the high-temperature “false” vacuum, and as the universe cools, 
there would come a time when the two vacuum states had the same energy, and after that, at lower 
temperatures, the true vacuum would be more stable. But in this situation, there is still an energy 
barrier between the two vacua so the transition from the false vacuum to the true vacuum is not 
immediate. The false vacuum is said to be “meta-stable.” What happens then is that either as the 
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result of a seed, or of random quantum mechanical fluctuations in the case of the universe, small 
regions of true vacuum can form and if they are large enough, they realize that they are more stable 
and grow and eventually permeate the whole space. Rain drops and the little bubbles that form 
when you boil water are examples of the same kind of phenomenon. 

But one might think that even if the vacuum were in a different phase in the early stages of 
the big bang, that as the universe expands and cools, the usual vacuum would simply eventually 
reappear when things got cool enough. There is the interesting physics of metastability associated 
with the transition from one vacuum to another. But otherwise, the effect of the false vacuum 
should simply disappear when the universe cools. What Guth and others gradually realized is 
that this situation gets more complicated in the presence of gravity. It is true that eventually this 
false vacuum will decay into ordinary particles moving in the “true” low temperature vacuum with 
the minimum energy. But in the meantime, while the false vacuum fills space, something bizarre 
happens — the space full of false vacuum expands incredibly rapidly! Strange as this behavior is, 
we can actually understand it very simply from the equation for Hubble dynamics that we derived 
last time and used to discuss the evolution of the expansion of the universe. For a flat universe, 1 
this can be written as follows: 



( 1 ) 


where a is any distance between points in space (distance between galaxies, whatever). We derived 
this assuming that p is the mass density of galaxies, but I assured you that it is true more generally 
if p (in units with c = 1) is interpreted as the energy density — basically because any kind of 
energy, not just rest energy (which is massxc 2 ) can produce and feel the force of gravity. 

We have seen how this works if the energy density comes primarily from nonrelativistic matter 
or from radiation. Now lets consider the surprising effect of gravity on a universe in which the 
energy density is dominated by the energy of a “false vacuum.” Here the difference is that as the 
universe expands, you just get more false vacuum, just as you get more true vacuum when the space 
in our present universe expands. But if the energy density of the false vacuum is the dominant 
contribution to p, that means that p does not fall off as a increases — it remains approximately 
constant, determined by the energy density of the false vacuum. 

This sounds a little crazy. It is worth checking it with the relation we derived last time for the 
rate of change of the energy density, 



( 2 ) 


To check this, we need to know the relation between energy density and pressure in the false 
vacuum. Suppose that we have some false vacuum in a cylinder that we can expand with a piston 

1 We will discuss what happens if the universe is not flat shortly. 
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When we pull out the piston a distance x, we make more false vacuum — with volume xA where 
A is the area of the piston. But this costs energy pxA. 



Thus the piston must be pulling back on us with a force pA. But that means that the pressure 
associated with the false vacuum is negative — equal to —p. This weird result makes sense, 
because if we put p = —p into (2), we find that p = 0 — which is just what we expect — the 
energy density of the false vacuum is constant, independent of the volume of space. 

So now that we are convinced that it makes a kind of weird physical sense to have p constant, 
we can go back to ( 1 ), which becomes 

a I~8 ttG~p 

- = \^ L = il (3) 

a V 3 

where p is a constant! This is really easy to solve. 

a — pa =>■ a oc e vt (4) 


The expansion of the space is actually exponential in time!!!! As you know, exponentials can get 
big quickly. This process is called “inflation” — during inflation, while the false vacuum lasts, 
space, and the universe along with it expands by some enormous factor. 

Wait a minute — this is even crazier than the constant p. Isn’t gravity supposed to slow down 
the expansion of the universe? Let’s see by looking at the equation we derived last time for the 
second derivative of a: 


a 

a 


4v tG 


(.P + 3 p) 


( 5 ) 


You see now that if we put in p = —p, the right hand side becomes positive (and you can check 
that it has the right value, p 2 ). This sign change is another effect of the negative pressure. This is 
the crazy physics of inflation. Positive pressure contributes to the slowing down of expansion, like 
energy density (only more so because of the factor of 3). But the false vacuum has NEGATIVE 
pressure, which gives a kind of antigravity effect! It’s weird, but all the equations are consistent. 
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It doesn’t really matter that we have used the flat-space (C = 0) version of the Hubble dynamics 
in (1). If the space is not flat at the beginning of an inflationary interlude, the rapid inflation will 
flatten it out, in the same way that the surface of a balloon gets flatter as we blow it up. Thus 
one consequence of inflation is the prediction that after inflation, when the false vacuum decays 
into normal vacuum and matter, the universe is very flat, so that the density must equal the critical 
density to a very good approximation. This is the theoretical prejudice in favor of flatness that I 
mentioned in the previous lectures. 

The cosmological principle and the Taylor expansion 

How does inflation help us with the Horizon Problem? Let’s think about our two regions on 
opposite sides of the observable universe. In an inflationary universe, these two distant regions 
were actually very close together before inflation, so it is very reasonable that they are surprisingly 
similar. To put this another way, suppose that before inflation, the distance over which there was 
significant variation of the physical properties of the universe was R. During inflation, as the space 
expands, this distance scales up with the general expansion of the space. Thus the distance over 
which there is significant variation, no matter what it was originally, becomes huge. But that means 
that the derivative of any property of the universe (for example, the temperature, T) with respect 
to position becomes very small after inflation. Suppose that the temperature looks like this as a 
function of some position variable before inflation. 
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If the universe inflates by a factor of 10, then the little piece between the dotted lines gets blown 
up to the same size as the original region, so the temperature looks like this. 



If the universe inflates by another factor of 10, this happens again. 


T 

T 


.495 .5 .505 

And so on. I hope that this rings a bell from our discussion long ago of the harmonic oscillator. 
This is the Taylor expansion in action. The temperature, and any other important quantity, if it is 
described by a function T(f) before inflation and doesn’t change during inflation of the space by a 
factor of k except due to the expansion of the space, then after inflation 

T(f ) = T(r/n) (6) 
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So that each derivative brings down a factor of \/k. Because the derivatives are tiny after inflation, 
the function can be approximated by the first constant term in a Taylor expansion. 

T{r) = T{ 0) + • ■ ■ (7) 

Inflation makes all the higher terms tiny. As long as there is only a very tiny variation of these 
quantities over the size of the visible universe, the visible universe looks to us as if it satisfies the 
cosmological principle. In the inflationary universe, it is the exponential expansion and the Taylor 
expansion that establishes the very smooth initial conditions of the big bang. 

I think that inflation probably has something to do with very early history of the observable 
universe but that there are a lot of pieces of the puzzle still missing. There are many confusing 
issues involved in understanding how the inflationary era turns into the conventional big bang. It 
is a pretty wild collection with names like “eternal chaotic inflation.” The idea is that the vacuum 
energy fluctuates because of quantum mechanics, and if, in some region, the vacuum energy is 
large enough, the negative gravity induced by its negative pressure causes it to inflate and grow 
wildly. Because the inflating regions are growing so fast, at any given time most of the volume of 
the universe is inflating. But in a few places, the vacuum energy settles down to its minimum value 
and something like our universe results. Absolutely nuts! Suffice it to say that some of these issues 
are actually being tested by the precise satellite studies of the cosmic microwave background. But 
many others will likely remain mysterious. This is a creation myth after all. We shouldn’t expect 
to understand everything! 

I do think that the general picture of an inflated universe is interesting — even liberating. 
Think for a minute about what this means. We have gotten used to thinking about our universe as 
unimaginably big. But in the inflationary view, our visible universe is actually incredibly tiny. All 
the billions of galaxies whose light has reached us since the beginning come from what was initially 
a tiny spot in a much bigger system, and beyond the edge of our visible universe, the universe goes 
on and on — whether it is infinite or not, it is definitely exponentially larger than what we can 
see. The cosmological principle is no longer needed — instead an approximate uniformity arises 
because of inflation. Every point in our visible universe is equivalent to every other because in 
the cosmic scheme of things, they were once nearly the same point! But we should not assume 
that distant parts of this inflated universe are similar to the little patch we live in. They may be 
very different indeed. Who knows whether even the laws of physics itself are constant over the 
absurdly large size of an inflated universe. Maybe if we could look sufficiently far away in various 
directions, we would find an infinite variety of completely different laws. Some of these might 
give rise to interesting but very different worlds. There is something about this picture that I find 
appealing — but since all these issues are invisible inside the confines of the visible universe, these 
questions are not very scientific. We will come back to this in a few minutes. 

The cosmological constant 

There is one more possibly very important thing to say about the energy density of the vacuum. 
In the discussion above, we have implicitly assumed that the true vacuum, after it settles down 


8 


to its equilibrium value, has zero energy density. But why should this be true? Why should the 
true vacuum, the true lowest energy state of the universe, the state with absolutely nothing in it 
— why should this vacuum have zero energy density. This question is only meaningful in the 
presence of gravity. In Newtonian mechanics, as you know, only differences in energy make a 
difference. We can always redefine the energy by subtracting a constant from everything. But as 
we have seen with the false vacuum, in the presence of gravity, this is not true. The energy density 
and the negative pressure associated with it have a gravitational effect. And no one has ever been 
able to come up with a convincing reason why such an energy density should not be there. In 
fact, it was suggested by Einstein himself, early in the history of general relativity. It is called the 
‘cosmological constant.” 

Einstein originally suggested the cosmological constant for what turned out to be the wrong 
reason. He felt, for philosophical reasons, that the universe should not change with time. He didn’t 
know about the Hubble expansion, which was discovered later. But with ordinary gravity, there is 
no way to make a universe that stands still. If it starts at rest, gravity will cause it to contract. But 
as we have seen, an energy density of the vacuum produces a repulsive gravitational effect because 
of its negative pressure. This can, if properly chosen, exactly cancel the effect of gravity, allowing 
for a stationary universe. 

Of course this original motivation for the cosmological constant has long since gone away. It 
is sometimes referred to as “Einstein’s mistake” because it seems very ugly to theorists like me. 
I’ll come back below to why it seems so ugly. Nevertheless, a decade ago, astronomers published 
evidence that it might not be a mistake after all. By studying type la supernovas 2 in distant galaxies, 
astronomers found evidence that the Hubble expansion is not slowing down as one would expect 
if the energy density of the universe is dominated by any kind of matter. Rather, it looks as if the 
expansion is speeding up! 

One of the the things that astronomers actually see is that the brightness of distant supernovas 
with red shifts between 0.3 and 1 is lower that what would be expected by computing their distance 
using their red shift. This is interpreted to to mean billions of years ago when this light was 
emitted, the relation between red shift and distance was different from what we would expect from 
extrapolating back from what we see now. For a given red shift, the objects were further away then 
than we would expect on the basis of the Hubble constant determined by looking at nearer (and 
thus more recently viewed) objects. Since we have to to farther out to get the same red shift, that 
means that the universe was expanding more slowly then. But that means that the expansion of the 
universe has been speeding up since the light from these supernovas was emitted, billions of years 
ago. 

A picture may help. If the rate of Hubble expansion were constant, we could plot the position 
of a distant galaxy as a function of time and get a straight line. But if the expansion of the universe 

2 These are supernovas that occur when a white dwarf star gobbles up matter and increases in mass beyond the 
point where the pressure from atomic matter that keeps the star from collapsing can compete with the gravity that is 
trying to squeeze it. When gravity finally wins, the result is a huge explosion that at least theoretically should always 
produce about the same amount of energy and should therefore provide a “standard candle” that allows astronomers 
to determine its distance by measuring its brightness. 
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is accelerating, we get something like this (with time going up): 



While this is going on, we are at the origin, just going up along the vertical axis. In fact, we 
expect the expansion of the universe to decelerate initially, when the ordinary energy dominates, 
but eventually to accelerate when the ordinary energy density falls below the constant dark energy 
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of the vacuum, so the picture might look something like this: 



Let’s not worry about this complication for the moment, and just focus on just the last part 
of this trajectory, where things are accelerating. I hope that the picture will make it clear what is 
going on in this experiment. The dotted line represents the light ray reaching us from the distant 
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supernova. 
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Two groups have independently seen this effect. One is a local group (involving our department 
chair Chris Stubbs): http://arxiv.org/abs/astro-ph/9805201, The other is a group from Berkeley: 
http://arXiv.org/abs/astro-ph/9812133, 

If all this is right, it implies that about 70% of the energy density of the universe, which it 
seems is roughly equal to the critical density required for flatness, comes from the energy of the 
vacuum. This is a bizarre, crazy result that I have a lot of trouble believing, for reasons that I will 
come back to. However, it has been around for almost a decade now as the number of supernovas 
on which the estimate is based has grown from a few to hundreds. There is also now considerable 
indirect support for this view from the consistency of the measured anisotropy of the CMBR and 
the gravitational lensing studies of dark matter with a model in which 30% of the energy density 
of the universe is in the form of cold dark matter and the rest is energy density. 

It is still hard to believe, but who knows. 


The laws of physics and the Taylor expansion 

Now finally, I want to return to the question that we discussed at the very beginning of the course. 
Why is F — dp/dtl Why is Newton’s law a formula for acceleration? Of course, we know that 
Newton’s law is not right. It ignores special relativity and quantum mechanics, for example. But 
in fact, our current theories of relativistic quantum mechanics are based on Lagrangians that are 
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really rather straightforward generalizations of those that we use to derive Newton’s law. The 
Lagrangians that we use to describe the world still depend just on coordinates and their first time 
derivatives. The equations of motion are thus still equations for “acceleration.” So in a certain 
sense, Newton has survived the revolutions of special relativity and quantum mechanics, and the 
question still remains an interesting one. I certainly don’t know the answer to this question. But I 
think that it is related to a much deeper question, which may be the central mystery about the way 
the universe works. 

This mystery takes a bit of explaining. We have already talked about the two constants, fi and 
c, that are built into the way the world works. We are used to setting c = 1. But nature is also 
telling us to set h = 1. If we adopt these sensible units (which are sometimes called “particle 
physics units” - you may have read about them in Morin’s book), then all dimensional quantities 
can be related. For example, we can express everything in terms of mass. The properties of our 
world are primarily determined by the masses of the electron and the proton, and a few numbers, 
like the fine structure constant, a — ^ ~ Almost all of the physics of the everyday world 
involves combinations of these basic parameters. But one thing that is different is gravity. Gravity 
is described by the gravitational constant G, which in particle physics units is proportional to 
1 / m pianck where the Planck mass is enormous, about 10 19 times the mass of the proton. G, is very 
very very tiny compared to any quantity with the same units that we might construct out of the 
parameter that describe the rest of our everyday world. 

Why should we care about gravity? Well, aside from the fact that it keeps us from flying off 
into space, there is a theoretical problem associated with gravity. It seems to be impossible to put 
special relativity, quantum mechanics, and gravity together consistently, without changing the rules 
in some way. This suggests that the Planck mass is the basic scale at which really interesting new 
physics, some change of the rules beyond special relativity and quantum mechanics, is happening. 
This is a very dicey argument, for various reasons, 3 but let’s assume that it is right. Then special 
relativity and quantum mechanics, and with them Newton’s law, are just approximations that are 
true for masses much smaller than the Planck mass. 

Now the important point is this. Every dot in the Lagrangian, every derivative with respect to 
time, in particle physics units, has units of mass (because a derivative is one over a distance). But 
if the fundamental scale is the Planck mass, each dot should come generically with a power of 
1/mpianck- Because m P i anck is so huge, the terms with more than the minimum number of dots can 
be ignored. They are there, but their effects are very small, and we don’t see them. 

If this is right, then F = dp/dt is just an approximation, but it is a very good one because 
gravity is so weak. The central mystery, then, is why is gravity so weak? Why is the Planck mass 
so very much larger than all the other masses that we care about in physics? This seems crazy. 
Where does the tiny dimensionless ratio of the proton mass to the Planck mass, about 10” 19 come 
from? Who know? Maybe one of you will figure it out, or maybe you will show that it is not the 

'For example, maybe someone will discover a clever way of doing it. People are trying. Maybe the string theorists 
have already done it, but they have really changed the rules, so this would just be an example of what I am saying. 
Or maybe the rules change again well before the Planck scale, and gravity emerges in some complicated way from 
physics at smaller masses. There are also some interesting ideas of this sort on the market today. 
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right question. 

In closing, let me finally come back once more to the cosmological constant. If the recent 
reports of a nonzero energy density of the vacuum are correct, this is ever crazier that the mystery 
of the proton mass. To see this, lets convert this cosmological constant to particle physics units. 
The statement is that the cosmological constant is about 70% of the critical density. The critical 
density is given by 



where T is the Hubble time, very roughly 10 10 


1 87TGp c 

T2 = 3 

yr. So 


( 8 ) 


Pc ~ 8v tGT 2 (9) 

The 1/G is just mp lanck . One year is about n x 10 7 seconds. The way I remember how to do these 
unit conversions is to convert things first to a funny unit of mass - billion electon volts - GeV - 
which is about the mass of the proton. In these units, one second is about 10 24 GeV” 1 . But the 
Planck mass is even larger, about 10 19 GeV, so 1 second is about 10 43 m FI ! tnck . Thus a year is about 
10 5 ° mp kinck (we can certainly drop the factor of 7r) and the Hubble time is about 10 6 ° mp 1 1 anck . 
Thus 

Pc ~ 10” 120 m 4 lanck (10) 

This is an even smaller dimensional scale than the proton mass. That is why it seems so crazy. 
Where does this tiny number come from? This situation is much worse even than the ratio of 10 19 
between the Planck mass and proton mass. If we had to guess from first principles the energy 
density of the universe, we would guess that the scale is set by the Planck mass, so we would 
guess mpi anck . We missed it by about a factor of 10 12 °. This wasn’t a very good guess! I think it 
is probably the worst guess in the history of mathematical science. What went wrong? Nobody 
knows. But it is suspcious that no only is it very small, but it is not so different from the critical 
density of the universe today. 

There are two kinds of ideas that people have discussed to try to explain away this puzzle. Many 
theorists believe that it has something to do with an odd kind of symmetry - called supersymmetry 

- which would imply an equivalence between matter particles and force particles. If this symmetry 
were exact, the cosmological constant would be zero. Unfortunately, we don’t see this symmetry 

- it is certainly broken in our world - and nobody have found a plausible explanation of how the 
symmetry could be broken, and still force the cosmological constant to be small. 

The other set of ideas is even crazier and goes by the term “Anthropic argument.” This is an 
updated version of an old idea that the laws of physics are what they are because if they were 
different, we wouldn’t be here to study them. The modern version of this goes something like this. 
Suppose that vacuum is a much more complicated thing than I have described to you, and actually 
the structure of the vacuum in which we live depends on many many parameters, and the energy of 
the vacuum is a complicated “landscape” with lots of peaks and basins. A two dimensional verison 
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really does look like a landscape — 



but of course the multidimensional version is much more complicated and hard to visualize. If 
the vacuum looks like this, and quantum mechanics is causing fluctuations in this complicated 
structure, then most regions in the huge chaotic universe are somewhere up on a hill, inflating away 
like mad because of negative gravity, but in some regions the universe settles down into one of the 
basins, producing matter and radiation which dominates the energy density. Then this region stops 
inflating, at least for a while, until the ordinary Hubble expansion of this region of the universe 
dilutes the matter and radiation and the cosmological constant dominates the energy density. Then 
the expansion accelerates at a rate determined by the value of the energy density at the bottom of the 
basin. This happens a huge number of times in different regions of the universe, in a huge number 
of different basins, at random. Each basin will have a different of the cosmological constant. This 
kind of universe is a kind of huge experimental laboratory with different experiments going on each 
of the regions where the universe settles down. One constraint that many people think is relevant 
has to do with the formation of galaxies. If the cosmological constant is too big in a basin, then 
the space continues to grow so fast that matter doesn’t have a chance to fall into separate galaxies. 
Then stars don’t form, planets don’t develop, and nobody gets to teach Physics 16. If you believe 
that this is necessary, it goes a ways towards explaining why the cosmological constant is so small. 

I am not sure that this is science. But one could imagine getting enough information about 
structure of this landscape to make predictions rather than just qualitatively explaining what we 
see. That would be interesting. 

At any rate, it seems clear that we need to understand the vacuum much better. If we are really 
lucky, we will get some hints to what is going on by seeing some surprising property of the vacuum 
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at the LHC in the next few years. 
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Above are a map and an arial photo of the CERN site, near the Geneva airport, where the LHC 
is being constructed. This is a monster project involving many thousand scientists and engineers. 
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Below is a photo in the tunnel - most of the 10 miles of it looks like this. 



It is pure speculation at this point to think that the LHC may teach us something about the dark 
energy. But it could happen. Certainly, the LHC will be our first opportunity to directly probe the 
mysterious physics of the vacuum, of which the dark energy seems to be another manifestation. 
That should make the next ten years an incredibly exciting time for fundamental physics. 
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